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A SYSTEM FOR CELL-BASED SCREENING 



5 Cross Reference 

This application claims priority to U.S. Provisional Applications for Patent 
Serial Nos. 60/122,152 (February 26, 1999), 60/123,399 (March 8, 1999), 09/352,141, 
(July 12, 1999), 60/151,797 (August 31, 1999), 60/168,408 (December 1, 1999); and is 
a continuation in part of 09/430,656 (October 29, 1999); 09/398,965 filed September 
10 17, 1999 which is a continuation in part of Serial No. 09/031,271 filed Febmaiy 27, 
1998 which is a continuation in part of U.S. Application S/N 08/810983, filed on 
February 27, 1997. 

Field of The Invention 

This invention is in the field of fluorescence-based cell and molecular 
biochemical assays for drug discovery. 

Background of the Invention 

20 , 

Drug discovery, as currently practiced in the art, is a long, multiple step process 
involving identification of specific disease targets, development of an assay based on a 
specific target, validation of the assay, optimization and automation of the assay to 
produce a screen, high throughput screening of compound libraries using the assay to 

25 identify "hits", hit validation and hit compound optimization. The ouQ>ut of this 
process is a lead compound that goes into pre-clinical and, if validated, evmtually into 
clinical trials. In this process, the screening phase is distinct fi-om the assay 
development phases, and involves testing compound efficacy in living biological 
systems. 

30 Historically, drug discovery is a slow and costly process, spanning numerous 

years and consuming hundreds of millions of dollars per drag created. Developments 
in the areas of genomics and high throughput screening have resulted in increased 
capacity and efficiency in the areas of target identification and volume of compoimds 
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Historically, dmg discovery is a slow and costly process, spanning numerous 
years and consuming hundreds of millions of dollars per drug created. Developments 
in the areas of genomics and high throughput screening have resulted in increased 
capacity and eifficiency in the areas of target identification and volume of compounds 
screened. Significant advances in automated DNA sequencing, PGR ^iplication, 
positional cloning, hybridization arrays, and bibinfonnatics have greatly increased the 
number of gaies (and gene firagments) encoding potential drug screening targets. 
However, the basic scheme for drag screening remians the same. 

VaUdation of genomic targets ais points for therapeutic intervention using the 
existing methods and protocols has become a bottleneck in the drag discovery process 
due to the slow, manual methods ranployed, such as in vivo functional models, 
functional analysis of recombinant proteins, and stable cell Une expression of candidate 
genes. Primary DNA sequence data acquired through automated sequencing does not . 
permit identification of gene fiinction, but can provide information about common 
"motife" and specific gene homology when compared to known sequence databases. 
Genomic methods such as subtraction hybridization and RADE (rapid amplification of 
differential expression) can be used to identify genes that are up of dbwii regulated in a 
disease state model. However, identification and validation still proceed down the same 
pathway. Some proteomic methods use protein identification (global expression arrays, 
2D electrophoresis, combinatorial libraries) in combination with reverse genetics to 
identify candidate genes of interest. Such putative "disease associated sequehces" or 
DAS isolated as intact cDNA are a great advantage to thbse methods, but they are 
identified by the hundreds without providing aiiy information regarding type, activity, 
and distribution of the encoded protein. Choosing a subset of DAS as drag screening 
targets is "random", and thus extremely inefBcient, without fimctional data to iSrovidd a 
mechanistic link with disease. It is necessary, therefore, to provide new technologies to 
r^idly screen DAS to establish biological fimction, thereby improving target validation 
and candidate optimization in drag discovery. 

There are three major avenues for improving early drug discovery pfodiictivity . 
First, there is a need for tools that provide increased information handling capabiHtj^. 
Bioinformatics has blossomed with the rapid development of DNA sequencing systems 
and the evolution of the genomics database. Genomics is beginning to play a critical 
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role in the identification of potential new targets. Prpteomics has become indispensible 
in relating stnicture and functi order to predipt drug interactiqns. 

However, the next level of biological complexity is the cell. Therefore, there is a need 
to acquire, manage and search multindimensional informationyfrom cells. Secondly, 
5 there is a need for . higher throughput tools. Automation is a key to improving 
productivity as has already been demonstrated in DNA sequencing and high throughput 
primary screening,:^ 1^^^ for automated systems that extract 

multiple parameter information from cells that meet the need for higher throughput 
tools; The instant invention also provides for miniaturizing the, methods, thereby 
10 allowiiig .increased thrpu^put, while decreeing Ae volumes of reagents and test 
compounds reqmred m 

Radioactivity has been the dominant read-out in early drug discovery assays. 
However, the need for more information^ higher throughput md miniaturization has 
caused a shift tow£irds u^ing fluorescence detection. Fluorescence-based reagents can 
15 yield n^pre powgrfu miiltiple parameter assays that are higher :in throughput and 
information contpnt and pqm^ lower volumes of reagents ^ and test compounds. 
Fluorescence is also safer and less e^qjensive than radioactivity-based methods. 

Screeqjng of ceUs freated with dyes smd fluorescent reagents is well known in 
the art. There is a considerable body of literature related to genetic engineering of cells 
20 to produce fluorescent proteins, such as modified, green fluorescent protein (GFP), as a 
reporter molecule. Some properties of wild-type GFP are disclosed by Morise et al. 
{Biochemistry 13 (1974), p. 2656r2662)„and Ward et al. {PKotochem. PhotobioL 31 
(1980), p. 611-615). The GFP of the jellyfish Aequorea victoria Ms an exditation 
maximum at 395 mn and an emission maximum at 510 lun, and does not require an 
25 exogenous factor for fluorescence activity. Uses for GFP disclosed in the literature are 
widespread and mplude the study of gene expression and protein localization (Chalfie 
et al.. Science 263 (1994), p. 12501-12504)), as a tool for visualizmg subcellular 
organelles (Rizzuto et al.. Cum Biology 5 (1995), p. 635-642)), visualization of protein 
transport along the secretory pathway (Kaether and Gerdes, FEBS Letters 369 (1995), 
30 p. 267-271)), expression in plant cells (Hu and.Cheng, FEBS Letters 369 (1995), p. 
331-334)) and Drosophila eijibiyos (Davis et al.. Dev. Biology 170 (1995), p. 726- 
729)), and as a reporter molecule fiised to another protein of interest (U. S. Patent 
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5.491,084). Similarly;°W096/23898 relates to methods of detecting biologically active 
substances affecting iritracellulaif' processes by utilizing a GFP construct having a 
protein kinase activation site. This patent, and all other patents fefei'^ced in this 
appUcation are ihcGiporated 'by reference in their entire . 

Nuinerous references are- related to GFP proteins in biological systems. For 
example, WO 96/09598 describes a Systran for isolating cells of rhteresi utili2iing the 
expression of a GFP Kke protein; WO 96/27675 describes the expresisioh of GFP in 
plants. WO 95/21191 describes modified GFP pifbBin expressed in traiisfoimed 
organisms to detect mutagenesis^ U. S; Pateaats 5,401,629 and 5,436,128 describe 
assays and compositioris for <ietecting ahd^evaluatiiig the intiicelliilaf transduction of 
an extracellular signal liising recombinant cells that express ceir sinface receptois aiiS ' 
contain reporter gene constructs that include trahscriptiori^ regillatdfy eleriieiits thiit are 
responsive to the activity of ceH surface receptors. 

Perfonning a screen on many thousands of cdxnpbimds requires parallel 
15 handling and processing of inahy compounds and assay fcompoiient reagents.' Stkidard 
high throughput screens (VHTS") use mixtures of compounds and biological reaieiits 
along with some indicator compoimd loaded into ari-ays of wells in standard microtiter 
plates with 96 or 384 wellsi The signal ineasiared from each Well^'either fliibr^cence 
emission^ optical density, or radioactivity, integrates the signal from all the material in 
20 the well giving an oyerall population average ofallthe molecules in the well. 

Science .Apphcations Mtematiohal edrpdiriation (SAIC) 130 FifrM Avenue, 
Seattle, WA. , 98109) describes an- imaging f>late reader. This system uses a CCD 
camera to image the whole area of a 96 well plate. The iniage is analyzed to calculate 
the total fluorescence per well for all the material in the well. 

Molecular Devices, Inc. (Sunnyvale, CA) describes a system (FLIPR) which 
uses low angle laser scanning illumination and a mask to selectively excite fluorescence 
within approximately 200 microns of the bottoins of the wells in standard 96 well 
plates in order to reduce background when imaging cell monolayers. This system uses 
a CCD camera to image the whole area of the plate bottom. Aiaiough this system 
measures signals originating from a cell monolayer at the bottom of the well;' the signal 
measured is averaged over the area of the well and is therefore still considered a 
measurement of the average response of a population of cells. The image is analyzed to 
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calculate the total fluorescence per well for cell-based assays. Fluid delivery devices 
have also been incorporated into cell based screening systems, such as the FLIPR 
system, in order to initiate a response, which is then observed as a whole well 
population average response using a macro-imaging system. 

hi contrast to high throughput screens, various high-content screens ("HCS") 
have been developed to address the need for more detailed information about the 
temporal-spatial dynamics of cell constituents and processes. High-content screens 
automate the extraction of multicolor fluorescence information derived from specific 
fluorescence-based reagents incorporated into cells (Giuliano and Taylor (1995), Curr. 
Op. Cell Biol. 7:4; GiuUano et al. (1995) Ann. Rev. Biophys. Biomol. Struct. 24:405). 
Cells are analyzed iising an optical system that can measure spatial, as well as temporal 
dynamics. (Farkas et al. (1993) Ann. Rev. Physiol. 55:785; GiuUano et al. (1990) In 
Optical Microscopy for Biology. B. Herman and K. Jacobson (eds.), pp. 543-557. 
Wiley-Liss, New York; Hahn et al (1992) Nature 359:736; Waggoner et al. (1996) 
Hum. Pathol 27:494). The concept is to treat each cell as a "well" that has spatial and 
temporal information on the activities of the labeled constituents. 

The types of biochemical and molecular information now accessible through 
fluorescence-based reagents applied to cells include ion concentrations, membrane 
potential, specific translocations, enzyme activities, gene expression, as well as the 
presence, amounts and patterns of metabolites, proteins, lipids, carbohydrates, and 
nucleic acid sequences (DeBiasio et al., (1996) Mol. Biol. Cell. 7:1259;Giuliano et al., 
(1995) ^n«. Rev. Biophys. Biomol. Struct. 24:405; Heim and Tsien, (1996) Curr. Biol 
6:178). 

High-content screens can be performed on either fixed cells, using fluorescently 
labeled antibodies, biological Ugands, and/or nucleic acid hybridization probes, or live 
cells using multicolor fluorescent indicators and "biosensors." The choice of fixed or 
live cell screens depends on the specific cell-based assay required. 

Fixed cell assays are the simplest, since an array of initially living cells in a 
microtiter plate format can be treated with various compounds and doses being tested, 
then the cells can be fixed, labeled with specific reagents, and measured. No 
environmental control of the cells is required after fixation. Spatial information is 
acquired, but only at one time point. The availability of thousands of antibodies, 
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ligands and nucleic acid hybridization probes that can be applied to cells makes this an 
attractive approach for many types of cell-based screens. The fixation and labeling 
steps can be automated, allowing efficient processing of assays. 

Live cell assays are more sophisticated and powerful, since an array of living 
cells containing the desired reagents can be screened over time, as well as space. 
Environmental control of the cells (temperature, humidity, and carbon dioxide) is 
required during measurement, since the physiological health of the cells must be 
maintained for multiple fluorescence measurements over time. There is a growing list 
of fluorescent physiological indicators and '"biosensors" that can report changes in 
biochemical and molecular activities within cells (Giuliano et al., (1995) Ann, Rev. 
Biophys. BiomoL Struct. 24:405; Hahn et ah, (1993) In Fluorescent and Luminescent 
Probes for Biological Activity. W.T. Mason, (ed.), pp. 349-359, Academic Press, San 
Diego). 

The availabiUty and use of fluorescence-based reagents has helped to advance 
the development of both fixed and live cell high-content screens. Advances in 
instrumentation to automatically extract multicolor, high-content information has 
recently made it possible to develop HCS into an automated tool. An article by Taylor, 
et al. {American Scientist 80 (1992), p. 322-335) describes many of these methods and 
their applications. For example, ProfEitt et. al. {Cytometry 24: 204-213 (1996)) describe 
a semi-automated fluorescence digital imaging system for quantifying relative cell 
numbers in situ in a variety of tissue culture plate formats, especially 96-weIl microtiter 
plates. The system consists of an epifluorescence inverted microscope with a 
motorized stage, video camera, image intensifier, and a microcomputer with a PC- 
Vision digitizer. Turbo Pascal software controls the stage and scans the plate taking 
multiple images per well. The software calculates total fluorescence per well, provides 
for daily calibration, and configures easily for a variety of tissue culture plate formats. 
Thresholding of digital images and reagents which fluoresce only when taken up by 
living cells are used to reduce background fluorescence without removing excess 
fluorescent reagent; 

Seaming confocal microscope imaging (Go et al., (1997) Analytical 
Biochemistry 247:210-215; Goldman et al., (1995) Experimental Cell Research 
221:311-319) and multiphoton microscope imaging (Denk et al., (1990) Science 
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248:73; Gratton et al., (1994) Proc, of the Microscopical Society of America, pp. 154- 
155) are also well established methods for acquiring high resolution images of 
microscopic samples. The principle advantage of these optical systems is the very 
shallow^ depth of focus, which allows features of limited axial extent to , be resolved 
5 against the background. For example, it is possible to resolve internal cytoplasmic 
features of adherent cells jfrom the features on the cell surface. Because scanning 
multiphoton imaging requires very short duration pulsed laser systems to achieve the 
high photon flux required, fluorescence lifetimes can also be measured in these systems . 
(Lakowicz et al., (1992) Anal Biochem. 202:316-330; Gerrittsen et al. (1997), J. of 
10 Fluorescence 7:11-15)), providing additional capability for different detection modes. 
Small, reliable and relatively inexpensive laser systems, such as laser diode pumped 
lasers, are now available to allow multiphoton confocal microscopy to be ^plied in a 
fairly routine fashion. 

A combination of the biological heterogeneity of cells in populations (Bright, et 
15 al., (1989). X Cell Physiol 141:410; Giuliano, (1996) Cell Motil Cytoskel 35:237)) as 
well as the high spatial and temporal frequency of chemical and molecular information 
present within cells, makes it impossible to extract high-content infomiation from 
populations of cells using existing whole microtiter plate readers. No existing high- 
content screening platform has been designed for multicolor, fluorescence-based 
20 screens using cells that are analyzed individually. Similarly, no method is currently 
available that combines automated fluid delivery to arrays of cells for the purpose of 
systematically screening compounds for the abihty to induce a cellular response that is 
identified by HCS analysis, especially from cells grown in microtiter plates. 
Furthermore, no method exists in the art combining high throughput well-by-well 
25 measurements to ideritify "hits*' in one assay followed by a second high content cell-by- 
cell measurement on the same plate of only those wells identified as hits. 

The instant invention provides systems, methods, and screens that combine high 
throughput screening (HTS) and high content screening (HCS) that significantly 
improve target validation and candidate optimization by combining many cell screening 
30 formats with fluorescence-based molecular reagents and computer-based feature 
extraction, data analysis, and automation, resulting in increased quantity and speed of 

7 



BNSDOCID: <WO 0O5Oa72A2_l_> 



wo 00/50872 



PCT/USOO/04794 



data collection, shortened cycle times, and, ultimately, faster evaluation of promising 
drug, candidates. The instant invention also provides for miniaturizing the methods, 
thereby allowing increased throughput, while decreasing the voliunes of reagents and 
test compounds required in each assay. 

SUMMARY OF THE INVENTION 

In one aspect, the present invention relates to a method for analjrzing cells 
comprising providing cells containing fluorescent reporter molecules in an array of 
locations, treating the cells in the array of locations with one or more reagents, 
imaging numerous cells in each location with fluorescence optics, converting the 
optical inforaiation into digital data, utilizing the digital data to determine the 
distribution, enviroimient or activity of the fluorescently labeled reporter molecules in 
the cells and the distribution of the cells, and interpreting that information in terms of a 
positive, negative or null effect of the compound being tested on the biological 
function 

In this embodiment, the method rapidly determines the distribution, 
environment, or activity of fluorescently labeled reporter molecules in cells for the 
purpose of screening large numbers of compounds for those that specifically affect 
particular biological functions. The array of locations may be a microtiter plate or a 
microchip which is a microplate having cells in an array of locations. In a preferred 
embodiment, the method includes computerized means for acquiring, processing, 
displaying and storing the data received. In a preferred embodiment, the method 
further comprises automated, fluid delivery to the arrays of cells. In another preferred 
embodiment, the information obtained from high throughput measurements on the 
same plate are used to selectively perform high content screening on only a subset of 
the cell locations on the plate. 

In another aspect of the present invention, a cell screening system is provided 
that comprises: 

• a high magnification fluorescence optical system having a microscope 
objective, 
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25 



30 



« an XY stage adapted for holding a plate containing an array of cells and 
having a means for moving the plate for proper alignment and focusing on 
the cell arrays; 

• a digital camera; 

5 "a light source having optical means for directing excitation light to cell 

arrays and a means for directing fluorescent light emitted from the cells to 
the digital camera; and 

• a computer means for receiving and processing digital data from the digital 
camera wherein the computer means includes a digital frame grabber for 

10 receiving the images from the camera, a display for user interaction and 

display of assay results, digital storage media for data storage and archiving, 
and a means for control, acquisition, processing and display of results. 

In a preferred embodiment, the cell screening system fiirther comprises a 
15 computer screen operatively associated with the computer for displaying data. In 
another preferred embodiment, the computer means for receiving and processing digital 
data from the digital camera stores the data in a bioinformatics data base. In a fiuther 
preferred embodiment, the cell screening system further comprises a reader that 
measures a signal from many or all the wells in parallel. In another preferred 
embodiment, the cell screening system fiirther comprises a mechanical-optical means 
for changing the magnification of the system, to allow changing modes between high 
throughput and high content screening. In another preferred embodiment, the cell 
screening system fiirther comprises a chamber and control system to maintain the 
temperature, CO2 concentration and humidity sun-ounding the plate at levels required to 
keep cells alive. In a fiirther prefen-ed embodiment, the cell screening system utilizes a 
confocal scanning illumination and detection system. 

In another aspect of the present invention, a machine readable storage medium 
comprising a program containing a set of instmctions for causing a cell screening 
system to execute procedures for defining the distribution and activity of specific 
cellular constituents and processes is provided. In a preferred embodiment, the cell 
screening system comprises a high magnification fluorescence optical system with a 
stage adapted for holding cells and a means for moving the stage, a digital camera, a 
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light source for receiving and processing the digital data from the digital camera, and a 
computer means for receiving and processing the digital data from the digital camera. 
Preferred embodiments of the machine readable storage medium comprise programs 
consisting of a set of instructions for causing a cell screening system to. execute the 
procedures set forth in Figm-es 9, 11, 12, 13, 14 or 15. Another preferred embodiment 
comprises a program consisting of a, set of instructions for causing a cell screening 
system to execute procedures for detecting the distribution and activity of specific 
cellular constituents and prqcesses, . Jn. m^qst preferred embodiments, the cellular 
processes include, but are not limited to, nuclear translocation of; a protein, cellular 
hypertrophy, apoptosis, and proteaserinduced translocation of a protein. 

In another preferred einbodimenti a variety of alitoiha^^ cell screening methods 
are provided,, including screens to identify cbmpoiihds^ that affect trahsdription factor 
activity, protein kinase activity, cell morphology, microtubule stmcture, apoptosis, 
receptor internalization, and protease-iriduced translocation of a prbteih. 

In a:nother aspect, thei present invention provides recombinant nucleic acids 
encoding a protease biosensor, comprising: 

a. a first nucleic acid sequence that encodes at least one detectable 
polj3)eptide signal; 

b. a secoiid nucleic acid sequence that encodes at least one protease 
recognition site, wherein the second nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide, signal; 
and 

c. a third nucleic acid sequence that encoders at least one reactant target 
sequence, wherein the third nucleic acid sequence is operatively linked to the second 
nucleic acid sequence that encoded the at least one protease recognition site. 

The present invention also provides the recombinant expression vectors capable 
of expressing the recombinant nucleic acids encoding protease biosensors, as well as 
genetically modified host cells that are transfected with the exjjression vectors. 

The invention fiirther provides recombinant protease biosensors, comprising 

a. a first domain comprising at least one detectable polypeptide signal; 

b. a second domain comprising at least' one protease recognition site; and 

c. a third domain comprising at least one reactant target sequence; 
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wherein the first domain and the third domain are separated by the second 
domain. 

In a further aspect, the present invention involves assays and reagents for 
characterizing a sample for the presence of a toxin. The method comprises the use of 
5 detector, classifier, and. identifier classes of toxin biosensors to provide for various 
. levels of toxin characterization. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shovi^s a diagram of the components of the cell-based scanning system. 
10 Figure 2 shows a schematic of the microscope subassembly. 
Figure 3 shows the camera subassembly. 
Figure 4 illustrates cell scanning system process. 

Figure 5 illustrates a user interface showing major functions to guide the user. 
Figure 6 is a block diagram of the two platform architecture of the Dual Mode System 
15 for Cell Based Screening in which one platform uses a telescope lens to read all wells 
of a microtiter plate and a second platform that uses a higher magnification lens to read 
individual cells in a well. 

Figure 7 is a detail of an optical system for a single platform architecture of the Dual 
Mode System for Cell Based Screening that uses a moveable ^telescope' lens to read all 
20 wells of a microtiter plate and a moveable Higher magnification lens to read individual 
cells in a well. 

Figure 8 is an illustration of the fluid delivery system for acquiring kinetic data on the 
Cell Based Screening System. 

Figure 9 is a flow chart of processing step for the cell-based scanning system. 
25 Figure 1 0 A- J illustrates the strategy of the Nuclear Translocation Assay. 

Figure 11 is a flow chart defining the processing steps in the Dual Mode System for 
Cell Based Screening combining high throughput and high content screening of 
microtiter plates. 

Figure 12 is a flow chart defining the processing steps in the High Throughput mode of 
30 the System for Cell Based Screening, 

Figure 13 is a flow chart defining the processing steps in the High Content mode of the 
System for Cell Based Screening. 
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Figure 14 is a flow chart defining the processing steps required for acquiring kinetic 
data in the High Content mode of the System for Cell Based Screening. 
Figure 15 is a flow chart defining the processing steps performed within a well during 
the acquisition of kinetic data. 

Figure 16 is an example of data fi*om a known inhibitor of translocation. 
Figure 17 is an exarhple of data from a known stimulator of translocation. 
Figure 18 illustrates data presentation on a graphical display. 

Figure 19 is an illustration of the data fi-om the High Throughput mode of flie System 
for Cell Based Screening, an example of the data passed to the High Content mode, the 
data acquired in the high content mode, and the results of the analysis of that data. 
Figure 20 shows the measurement of a dnig-mduced cytoplasm to nuclear 
translocation. 

Figure 21 illustrates a graphical user interface of the measurement shown in Figure 20. 
Figure 22 illustrates a graphical - user interface, with data, presentation, of the 
measurement shown in Fig. 20. 

Figure 23 is a graph representing the kinetic data obtained fi-om the measurements 
depicted in Fig. 20. ' 

Figure 24 details a high-content screen of drug-induced apoptosis. 
Figure 25. Graphs depicting changes in morphology upon induction of apoptosis. 
Staurosporine (A) and paclitaxel (B) induce classic nuclear fi-agmentation in L929 cells. 
BHK cells exhibit concentration dependent changes in response to staurosporine (C), 
but a more classical response to pachtaxel (D). MCF-7 cells exhibit either nuclear 
condensation (E) or fi-agmentation (F) in response to staurosporine and paclitaxel, 
respectively. In all cases, cells were exposed to the compounds for 30 hours. 
Figure 26 illustrates the dose response of cells to staurosporine in temis of both nuclear 
size and nuclear perimeter convolution. 

Figure 27, Graphs depicting induction of apoptosis by staurosporine and paclitaxel 
leading to changes in peri-nuclear f-actin content. (A, B) Both apoptotic stimulators 
induce dose-dependent increases in f-actin content in L929 cells. (C) In BHK cells, 
staurosporine induces a dose-dependent increase in f-actin, whereas paclitaxel (D) 
produces results that are more variable. (E) MCF-7 cells exhibit either a decrease or 
increase depending on the concentration of staurosporine. (F) Paclitaxel induced 
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changes in f-actin content were highly variable and not significant. Cells were exposed 
to the compounds for 3 0 hours. 

Figure 28. Graphs depicting mitochondrial changes in response to induction of 
apoptosis. L929 (A3) and BHK (C,D) cells responded to both staurosporine (A,G) and 
paclitaxel (B,D) with mcreases in mitochondrial mass. MCF^7 cells exhibit either a 
decrease in membrane potential (E, staurosporine) or an increase in mitochondrial mass 
(F, paclitaxel) depending on the stimulus. Cells were exposed to the compounds for 30 
hourSi 28G is a graph showing the simultaneous measurement of staurosporine effects 
on mitochondrial mass and mitochondrial potential in BHK cells. 

Figure 29 shows the nucleic acid and amino acid sequence for various types of 
protesae biosensor domains. (A) Signal sequences. (B) Protease recognition sites. (C) 
Product/Reactaiit target sequences 

Figure 30 shows schematically shows some basic organization of domains in the 
protease biosensors of the invention. 

Figure 31 is a schematic diagram of a specific 3-domain protease biosensor. 
Figure 32 is a photograph showing the effect of stimulation of apoptosis by cis-platin 
on BHK cells transfected with an expression vector that expresses the caspase 
biosensor shown in Figure 32. 

Figure 33 is a schematic diagram of a specific 4-domain protease biosensor. 

Figure 34 is a schematic diagram of a specific 4-domain protease biosensor, containing 

a nucleolar localization signal. 

Figure 35 is a schematic diagram of a specific 5-domain protease biosensor. 
Figure 36 shows the differential response in a dual labeling assay of the p38 MAPK 
and NF-kB pathways across three model toxins and two different cell types. 
Treatments marked with an asterisk are different firom controls at a 99% confidence 
level (p < 0.0 1). 

DETAILED DESCRIPTION OF THE INVENTION 

All cited patents, patent applications and other references are hereby 
incorporated by reference in their entirety^^^^^ 

As used herein, the following terms have the specified meamng: 
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Markers of cellular domains. Luminescent probes that have high affinity for 
specific cellular constituents including specific organelles or molecules. These probes 
can either be small luminescent molecules or fluorescently tagged macromolecules 
used as "labeling reagents", "environmental indicators", or "biosensors." 

Labeling reagents. Labeling reagents include, but are not limited to, 
luminescently labeled macromolecules including fluorescent protein analogs and 
biosensors, luminescent macromolecular chimeras including those formed with the 
green fluorescent protein and mutants thereof, luminescently labeled primary or 
secondary antibodies that react with cellular antigens involved in a physiological 
response, luminescent stains, dyes, and other small molecules. 

Markers of cellular translocations. Luminescently tagged macromolecules or 
organelles that move from one cell domain to another during some cellular process or 
physiological response. Translocation markers can either simply report location 
relative to the markers of cellular domains or they can also be "biosensors" that report 
some biochemical or molecular activity as well. 

Biosensors. Macromolecules consisting of a biological functional domain and a 
luminescent probe or probes that report the environmental changes that occur either 
internally or on their surface. A class of luminescently labeled macromolecules 
designed to sense and report these changes have been termed "fluorescent-protein 
biosensors". The protein component of the biosensor provides a highly evolved 
molecular recognition moiety. A fluorescent molecule attached to the protein 
component in the proximity of an active site transduces environmental changes into 
fluorescence signals that are detected using a system with an appropriate temporal and 
spatial resolution such as the cell scanning system of the present invention. Because 
the modulation of native protein activity within the living cell is reversible, and because 
fluorescent-protein biosensors can be designed to sense reversible changes in protein 
activity, these biosensors are essentially reusable. 

Disease associated sequences ("DAS"). This term refers to nucleic acid 
sequences identified by standard techniques, such as primary DNA sequence data, 
genomic methods such as subtraction hybridization and RADE, and proteomic methods 
in combination with reverse genetics, as being of drug candidate compoxmds. The term 
does not mean that the sequence is only associated with a disease state. 
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High content screening (HCS) can be used to measure the effects of drugs on 
complex molecular events such as signal transduction pathways, as well as cell 
functions including, but not limited to, apoptosis, cell division, cell adhesion, 
locomotion, exocytosis, and cell-cell communication. Multicolor fluorescence permits 
5 multiple targets and cell processes to be assayed in a single screen. Cross-correlation 
of cellular responses will yield a wealth of information required for target validation 
and lead optimization. 

In one aspect of the present invention, a cell screening system is provided 
comprising a high magnification fluorescence optical system having a microscope 

10 objective, an XY stage adapted for holding a plate with an array of locations for 
holding cells and having a means for moving the plate to align the locations with the 
microscope objective and a means for moving the plate in the direction to effect 
focusing; a digital camera; a light source having optical means for directing excitation 
light to cells in the array of locations and a means for directing fluorescent light emitted 

15 fi-om the cells to the digital camera; and a computer means for receiving and processing 
digital data from the digital camera wherein the computer means includes: a digital 
frame grabber for receiving the images from the camera, a display for user interaction 
and display of assay results, digital storage media for data storage and archiving, and 
means for control, acquisition, processing and display of results. 

20 Figure 1 is a schematic diagram of a preferred embodiment of the cell scanning 

system. An inverted fluorescence microscope is used i, such as a Zeiss Axiovert 
inverted fluorescence microscope which uses standard objectives with magnification of 
1-1 OOx to the_ camera, and a white light source (e.g. lOOW mercury-arc lamp or 75 W 
xenon lamp) with power supply 2. There is an XY stage 3 to move the plate 4 in the 

25 XY direction over the microscope objective, A Z-axis focus drive 5 moves the 
objective in the Z direction for focusing. A joystick 6 provides for manual movement 
of the stage in the XYZ direction. A high resolution digital camera 7 acquires images 
from each well or location on the plate. There is a camera power supply 8^ an 
automation controller 9 and a central processing unit JO. The PC 11 provides a display 

30 12 and has associated software. The printer 13. provides for printing of a hard copy 
record. 
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Figure 2 is a schematic of one embodiment of the microscope assembly 1 of the 
invention, showing in more detail the XY stage 3, Z-axis focus drive 5, joystick 6, light 
source 2, and automation controller 9. Cables to the computer VS and microscope 16, 
respectively, are provided. In addition. Figure 2 shows a 96 well microtiter plate 17 
5 which is moved on the XY stage 3 in the XY direction. Light from the light source 2 
passes through the PC controlled shutter 18 to a motorized filter wheel 19 with 
excitation filters 20. The light passes into filter cube 25 which has a dichroic mirror 26 
and an emission filter 27. Excitation light reflects off the dichroic mirror to the wells in 
the microtiter plate 17 and fluorescent light 28 passes through the dichroic mirror 26 

10 and the emission filter 27 and to the digital camera 7. 

Figure 3 shows a schematic drawing of a preferred camera assembly. The 
digital camera 7, which contains an automatic shutter for exposure control and a power 
supply 31, receives fluorescent light 28 from the microscope assembly. A digital cable 
30 transports digital signals to the computer. 

15 The standard optical configurations described above use microscope optics to 

directly produce an enlarged image of the specimen on the camera sensor in order to 
capture a high resolution image of the specimen. This optical system is commonly 
referred to as 'wide field' microscopy. Those skilled in the art of microscopy will 
recognize that a high resolution image of the specimen can be created by a variety of 

20 other optical systems, including, but not limited to, standard scanning confocal 
detection of a focused point or line of illumination scanned over the specimen (Go et al. 
1997, supra), and multi-photon scanning confocal microscopy (Denk et al,, 1990, 
supra), both of which can form images on a CCD detector or by synchronous 
digitization of the analog output of a photomultiplier tube. 

25 In screening applications, it is often necessary to use a particular cell line, or 

primary cell culture, to take advantage of particular features of those cells. Those 
skilled in the art of cell culture will recognize that some cell lines are contact inhibited, 
meaning that they will stop growing when they become surroimded by other cells, 
while other cell lines will continue to grow under those conditions and the cells will 

30 literally pile up, forming many layers. An example of such a cell line is the HEK 293 
(ATCC CRL-1573) line. An optical system that can acquire images of single cell 
layers in multilayer preparations is required for use with cell lines that tend to form 
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layers. r The large, depth of field of wide field micrpscopjes pro^^^^ image that is a 
. projection thr^ough the many layers, o making analysis subcellular spatial 

distributipn? ex^ layerrfonning cells. Alternatiyely, the shalloAv 

depth : off field . that pan be achieyed on a confqcalt microscope, (a^^ micron), 
5 allows discrimination of a single cell layer at high resolution, simplifying^ ^ 
determination of AenSubceUul^^^^^^ imaging is 

preferable when defteption modes such as flypresqence lifetmie imaging are required, 

The output of a standard confbcal imaging attachment for a microscope is a 
digital image that can be cpnyerted to the same fomiat as Jhe in[iages produced by the 

10 other cell screening system embodiments describei^^^^ can therefore, be 

processed in exactly the, sanie way as those images, TTie pveraU control, acquisition 
and analysis in this embodiment is essentially the same. The optical configuration of 
the confocal microscope system, is essentially the same as that described above, except 
for the illuminator and detectors. Illumination and detection systems required for 

15 confocal microscopy have been designed as accessories to be attached to standard 
microscope optical systems such as that of the present iiivention (Zeiss, Germany). 
These altemative optical systems therefore can be easily integrated into the system as 
described above. 

Figure 4 illustrates an altemative embodiment of the invention in which cell 

20 arrays are in microwells 40 on a microplate 41, described ion co-pending U.S. 
Application S/N 08/865,341, incorporated by reference herein in its entirety. Typically 
the microplate is 20 mm by 30 nmi as compared to a standard 96 well microtiter plate 
which is 86 mm by 129 mm. The higher density array of cells on a microplate allows 
the microplate to be imaged at a low resolution of a few microns per pixel for high 

25 throughput and particular locations on the microplate to be imaged at a higher 
resolution of less than 0.5 microns per pixel. These two resolution modes help to 
improve the overall throughput of the system, - 

The microplate chamber 42 serves as a microfluidic delivery system for the 
addition of compounds to cells. The microplate 41 in the microplate chamber 42 is 

30 placed in an XY microplate reader 43. Digital data is processed as described above. 
The small size of this microplate system increases throughput, minimizes reagent 
volume and allows control of the distribution and placement of cells for fast and precise 
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cell-based analysis. Processed data can be displayed on a PC screen U. and made part 
of a bioinfonnatics data base 44. This data base not only permits storage and retrieval 
of data obtained through the methods of this invention, but also permits acquisition and 
storage of extemal data relating to cells. Figure 5 is a PC display which illustrates the 

5 operation of the software. 

In an alternative embodiment, a high throughput system (HTS) is directly 
coupled with the HCS either on the same platform or on two separate platforms 
connected electronically (e.g. via a local area network). This embodiment of the 
invention, referred to as a dual mode optical system, has the advantage of increasing the 

10 throughput of a HCS by coupling it with a HTS and thereby requiring slower high 
resolution data acquisition and analysis only on the small subset of wells that show a 
response in the coupled HTS. 

High throughput 'whole plate' reader systems are well known in the art and are 
commonly used as a component of an HTS system used to screen large numbers of 

15 compounds (Beggs (1997), J. ofBiomoIea Screening 2:71-78; Macaffrey et al., (1996) 
J. Biomolec. Screening 1:187-190). 

In one embodiment of dual mode cell based screening, a two platform 
architecture in which high throughput acquisition occurs on one platform and high 
content acquisition occurs on a second platform is provided (Figure 6). Processing 

20 occurs on each platform independently, with results passed over a network interface, or 
a single controller is used to process the data from both platforms. 

As illustrated in Figure 6, an exemplified two platform dual mode optical 
system consists of two light optical instruments, a high throughput platform 60 and a 
high content platform 65* which read fluorescent signals emitted from cells cultured in 

25 microtiter plates or microwell arrays on a microplate, and communicate with each other 
via an electronic connection 64. The high throughput platform 60 analyzes all the wells 
in the whole plate either in parallel or rapid serial fashion. Those skilled in the art of 
screening will recognize that there are a many such commercially available high 
throughput reader systems that could be integrated into a dual mode cell based 

30 screening system (Topcount (Packard Instruments, Meriden, CT); Spectramax, 

Lumiskan (Molecular Devices, Sunnyvale, CA); Fluoroscan (Labsystems. Beverly, 

MA)). The high content platform 65, as described above, scans from well to well and 
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acquires and analyzes high resolution image data collected from individual cells within 
a well '^ 

The HTS software, residing on the system's computer 62, controls the high 
throughput itistrum^rit, and results are displayed on the monitbi- 6i. The HCS software, 
5 residing bri it*s computer system 67, coiitrbls the Wgh content instiounerit hardware 65, 
dptibhal devices (e.g. plate loader, enviroiiinferital chairibCT^ fluid dispenser), analyzes 
digital imaLge ^iata from the plate, dispilays results ori^ the monitor 66 and manages data 
measured in an integrated database. The two systems can also share a single compuleir, 
in which case all data wbiild be collected, processed and displayed on that computer^ 
10 without 4Me need for a local area network to transfer the data. Microtiter plates are 
transferred fi'6m'%e 'W the high content systein 63 eitiier 

niahually or by "a robotic plate transfer device, as is well known in the art (Beggs 
(1997), irwpra;^Mcaffre^ 

In a preferred embodiment, the dual mode optical system utilizes a single 
15 platform system (Figure 7). It consists of two separate optical modules, an HCS 
module 203 and an HTS module 209 that can be independently or collectively moved 
so that only one at a time is used to collect data from the microtiter plate 201 . The 
microtiter plate 201 is mounted in a motorized X,Y stage so it can be positioned for 
imaging in either HTS or HCS mode. After collecting and analyzing the HTS image 
20 data as described below, the HTS optical module 209 is moved out of the optical path 
and the HCS optical module 203 is moved into place. 

The optical module for HTS 209 consists of a projection lens 214. excitation 
wavelength filter 213 and dichroic mirror 210 which are used to illuminate the whole 
bottom of the plate with a specific wavelength band from a conventional microscope 
25 lamp system (not illustrated). The fluorescence emission is collected through the 
dichroic mirror 210 and emission wavelength filter 211 by a lens 212 vvhich forms an 
image on the camera 216 with sensor 215 , 

The optical module for HCS 203 consists of a projection lens 208. excitation 

wavelength filter 207 and dichroic-'m 204 which are used to illuminate the back 

30 aperture of the microscope objective 202. and thereby the field of that objective, from a 

standard microscope illumination system (not shown). The fluorescence emission is 
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collected by the microscope objective 202 , passes through the dichroic mirror 204 and 
emission wavelength filter 2Q5 and is focused by a tube lens 206 which forms an image 
on the same camera 216 with sensor 215. 

In an alternative embodiment of llie present invention, the cell screiening system 
fiirther comprises a fluid delivery device for use with the Ijive cell embodiment of the 
method of cell screening (see belpw); figure 8 exemplifies a fluid delivery dq>acejFor 
use with the system of ttie invention. It consists of a bank of 12„syMge p 701 
driven by a.single motor drive. Each syringe 702 is sized according to, the volume to be 
delivered to each well, t^ically betvveen 1 andlOO |iL. Eaph syringe is attached via 
flexible tubing 703 to a similar bank of connectors which accept sitandard pipette tips 
705 . The bank of pipette tips are attached . to a drive system so they can be lowered and 
raised relative to the microtiter plate 706 to deliver fluid to each well. The. plate is 
mounted on an X,Y stage, allowing movement relative to the optipal system 702 for 
data collection purposes. This set-up allows one set of pipette tips, or even a single 
pipette tip, to deliver reagent to all the wells on the plate. The bank of syringe pumps 
can be used to deliver fluid to 12 wells simultaneously, or to fewer wells by removing 
some of the tips. 

In another aspect, the present invention provides a method for analj^ing cells 
comprising providing an array of locations which contain multiple cells wherein the 
cells contain one or more fluorescent reporter molecules; scanning multiple cells in 
each of the locations containing cells to obtain fluorescent signals from the fluorescent 
reporter molecule in the cells; converting the fluorescent signals into digital data; and 
utilizing the digital data to detemime the distribution, environment or activity of the 
fluorescent reporter molecule within the cells. 

Cell Arrays 

Screening large numbers of compounds for activity with respect to a partictiiar 
biological function requires preparing arrays of cells for parallel handling of cells aind 
reagents. Standard 96 well micrptifer plates which are 86 mm by 129 nmi, with 6mm 
diameter wells on a 9mm pitch, are used for compatibility with current automated 
loading and robotic handling systems. The micrpplate is typically 20 mm by 30 mm, 
with cell locations that are 100-200 microns in dimension on a pitch of about 500 
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microns. Methods for making microplates ,arQ described in/U.Sl. Patent Application 
Serial No. 08/865,341, incorporated: by reference herein in its entirefty. , Micropliites 
may 1 consist of coplanar layers of materials to which cells adhere, patterned with 
materials to which cells will not adhere, or etched S.-dimensibnal surfaces of similarly 
5 pattered materials. For the purpose of the foUowing disc^^^^^ terms *weir and 

'micro weir refer to a location in an array of any construction to which ceUs adhere and 
within which the cells are irndged. Micrqplates inay also include fluid delivery 
chaimels in the spaces between;the wells. : The smaller format of a microplate increases 
the overall efficiency of the system by minimiziiig the quantities of the reagents, 

10 storage and handling during preparation and the overall movement required for the 
scarming ,operatiorx. In additiori, the whole area of the microplate can be irriaged more 
efficiently, allowing a second mode of operation for the microplate reader as described 
later in this document. r 
Fluorescence Reporter Molecules 

15 A major component of the new dmg discoveiy paradigm is a continually 

growing family of fluorescent and luminescent reagents that are used to measurei the 
teniporal and spatial distribution, content, and activity of intracellular, ions, metabolites, 
macromolecuies, and organelles. Glasses of these ^reagents include labeling reagents 
that measure the distribution and amount of molecules in living and fixed cells, 

20 environmental indicators to report signal transduction events in time and space, and 
fluorescent protein biosensors to measure target molecular activities within living cells. 
A niultiparameter approach that combines several reagents in a single cell is a powerfiil 
new tool for drug discovery. h 
The inethpd of the present invention is based on the high affinity of fluorescent 

25 or luminescent molecules for spe;cific cellular components. The affinity for specific 
coinponents is governed by physical forces such as ionic interactions, covalent bondirig 
(which includes chimeric fusion with protein-based chromophores, fluorophores, and 
luniiphqres), as well as hydrophobic interactions electrical potential, and, in some 
cases, simple entrapment within a cellular component. The luminescent probes can be 

30 small molecules, labeted macromolecuies, or genetically eiigineered proteins, 
including, but not limited to green fluorescent protein chimeras. 
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Those skilled in this aft will recognize a wide variety of fluorescent reporter 
molecules that can be used in the present invention, including, but not lunited to, 
fluoresdently labeled bidmolecules sueh as proteins,; phospholipids and DNA 
hybridising probed: Similarly, fluorescent reagents specifically synthesized with 
5 particular chemical properties of binding or association have been used as fluorescent 
reporter molecules (Barak et aLr(1997). J. Biol Chem. 272:27497-27500; Southwick et 
al., (^990), Cytometry 11:418^30; Tsieri (1989) in Methods in Cell Biologyi Vol 29 
Taylor and Wang (eds-), pi3. 127-156). Fluorescently labeled antibodies are particularly 
useful reporter molecules due to their high degree of specificity for attaching to a single 

10 mbleculkr target in a mixture of molecules as complex as a cell or tissue. ' 

The luminescent probes can be synthesized within the living cell or can be 
transported into the cell via several hbn-mechanical modes including diffusion, 
facilitated or active transport, signal-sequence-raediated transport, and endocytotic or 
pinocytotic uptake. Mechanical bulk loading methods, which are well known in the art, 

15 can also be used to load liuhiniesscent probes into living cells (Barber et al. (1996), 
Neurdscience^Zetters 207:17-20; Bright et aL (1996), Cytometry 24:226^33; McNeil 
(^9S9) WMethods iri Cell Biology, Vol 29, Taylor and Wang (eds.), pp. 1 53-173), 
These methods include electroporation and other mechanical methods such as scrape- 
loading, bead-loading, impact-loading, syringe-loading, hypertbnic and hypotonic 

20 loading. Additionally^ cells can be genetically engineered to j express reporter 
molecules, such as GFPv coupled to a protein of interest as previously described 
(Chalfie and Prasher U.S. Patent Nor 5,491,084; Cubitt et al. (1995), Trends in 
Biochemical Science 20:448-455). 

Once in the cell, the luminescent profcies accumulate at their target domain as a 

25 result of specific and high affinity interactions with the target domain or other modes of 

molecular targeting such as sighal-sequence-niediated transport. Fluorescently labeled 

reporter molecules are usefiil for deterininihg the location, ^ ^ and chemical 

environment of the reporter. For example, whether the reporter is in a lipophilic 

membrane enviroimient or in a more aqueous environment can be determined (Giuliaho 

30 et si. (1^9^, Am of Biophysics and Biom^^ Giuliano 

and Taylor (1995), Methods in Neuroscience 27 :l't6). The pH environment of the 

reporter can be determined (Bright et al. (1989), J. Cell Biology 104:1019-1033; 
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Giuliano et al. (1987), Anal Biochem. 167:362-371; Thomas et al. (1979), 
Biochemistry 18:2210-2218). It can be detennined whether a reporter having a 
chelating group is bound to an ion, such as Ca-H-, or not (Bright et al. (1989), In 
Methods in Cell Biology, Vol. 30, Taylor and Wang (eds.), pp. 157-192; Shimoura et aL 
(1988), 1 of Biochemistry (Tokyo) 251:405-410; Tsien (1989) In Methods in Cell 
Biology, Vol. 30, Taylor and Wang (eds.), pp. 127-156). 

Furthermore, certain cell types within an organism may contain components 
that can be specifically labeled that may not occur in other cell types. For example, 
epithelial cells often contain polarized membrane components. That is, these cells 
asymmetrically distribute macromolecules along their plasma membrane. Connective 
or supporting tissue cells often contain granules in which are trapped molecules specific 
to that cell type (e.g., heparin, histamine, serotonin, etc.). Most muscular tissue cells 
contain a sarcoplasmic reticulum, a specialized organelle whose fimction is to regulate 
the concentration of calcium ions within the cell cytoplasm. Many nervous tissue cells 
contain secretory granules and vesicles in which are trapped neurohormones or 
neurotransmitters. Therefore, fluorescent molecules can be designed to label not only 
specific components within specific cells, but also specific cells within a population of 
mixed cell types. 

Those skilled in the art will recognize a wide variety of ways to measure 
fluorescence. For example, some fluorescent reporter molecules exhibit a change in 
excitation or emission spectra, some exhibit resonance energy transfer where one 
fluorescent reporter loses fluorescence, while a second gains in fluorescence, some 
exhibit a loss (quenching) or appearance of fluorescence, while some report rotational 
movements (Giuliano et al. (1995), Ann, Rev. of Biophysics and Biomol Structure 
24:405-434; Giuliano et al. (1995), Methods in Neuroscience 27:1-16). 
Scanning cell arrays 

Referring to Figure 9, a preferred embodunent is provided to analyze cells that 

comprises operator-directed parameters being selected based on the assay being 

conducted, data acquisition by the cell screening system on the distribution of 

fluorescent signals within a sample, and interactive data review and analysis. At the 

start of an automated scan the operator enters information 100 that describes the 

sample, specifies the filter settings and fluorescent channels to match the biological 
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labels being used and the information sought, and then adjusts the camera settings to 
match the sample brightness. For flexibility to handle a range of samples, the software 
allows selection of various parameter settings used to identify nuclei and cytoplasm, 
and selection of different fluorescent reagents, identification of cells of interest based 
5 on morphology or brightness, and cell numbers to be analyzed per well. These 
parameters are stored in the system's for easy retrieval for each automated run. The 
system's interactive cell identification mode simplifies the selection of morphological 
parameter limits such as the range of size, shape, and intensity of cells to be analyzed. 
The user specifies which wells of the plate the system will scan and how many fields or 
10 how many cells to analyze in each well. Depending on the setup mode selected by the 
user at step lOK the system either automatically pre-focuses the region of the plate to 
be scanned using an autofocus procedure to "find focus" of the plate 102 or the user 
interactively pre-focuses 103 the scanning region by selecting three "tag" points which 
define the rectangular area to be scanned. A least-squares fit "focal plane model*' is 

15 then calculated from these tag points to estimate the focus of each well during an 
automated scan. The focus of each well is estimated by interpolating fi-om the focal 
plane model during a scan. 

During an automa;ted scan, the software dynamically displays the scan status, 
inclucfing the number of cells analyzed, the current well being analyzed, images of each 

20 independent wavelength as they are acquired, and the result of the screen for each well 
as it is determined. The plate 4 (Figure 1) is scanned in a serpentine style as the 
software automatically moves the motorized microscope XY stage 3 firom well to well 
and field to field within each well of a 96-well plate. Those skilled in the programming 
art will recognize how to adapt software for scanning of other microplate formats such 

25 as 24, 48, and 384 well plates. The scan pattern of the entire plate as well as the scan 
pattern of fields within each well are programmed. The system adjusts sample focus 
with an autofocus procedure 104 (Figure 9) through the Z axis focus drive 5, controls 
filter selection via a motorized filter wheel 19, and acquires and analyzes images of up 
to four different colors ("chaimels" or "wavelengths"). 

30 The autofocus procedure is called at a user selected frequency, typically for the 

first field in each well and then once every 4 to 5 fields within each well. The autofocus 
procedure calculates the starting Z-axis point by interpolating from the pre-calculated 
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plane focal model. Starting a programmable distance above or below this set point, the 
procedure moves the mechanical Z-axis through a number of different positions, 
acquires an image at each position, and finds the maximum of a calculated focus score 
that estimates the contrast of each image. The Z position of the image with the 
5 maximum focus score determines the best focus for a particular field. Those skilled in 
the art will recognize this as a variant of automatic focusing methods as described in 
Harms et al. in Cytometry 5 (1984), 236-243, Groen et al. in Cytometry 6 (1985), 81-91, 
and Firestone et al. in Cytometry 12 (1991), 195-206. 

For image acquisition, the camera's exposure time is separately adjusted for 

1 0 each dye to ensure a high-quality image from each charmel. Software procedures can be 
called, at the user's option, to correct for registration shifts between wavelengths by 
accounting for linear (X and Y) shifts between wavelengths before making any fiirther 
measurements. The electronic shutter 18 is controlled so that sample photo-bleaching is 
kept to a minimum. Background shading and uneven illumination can be corrected by 

15 the software using methods known in the art (Bright et al. (1987), J. Cell BioL 
104:1019-1033). 

In one channel, images are acquired of a primary marker 105 (Figure 9) 
(typically cell nuclei counterstained with DAPI or PI fluorescent dyes) which are 
segmented ("identified") usin§ an adaptive thresholding procedure. The adaptive 

20 thresholding procedure 106 is used to dynamically select the threshold of an image for 
separating cells from the background. The staining of cells with fluorescent dyes can 
vary to an unknown degree across cells in a microtiter plate sample as well as within 
images of a field of cells within each well of a microtiter plate. This variation can occur 
as a result of sample preparation and/or the dynamic nature of cells. A global threshold 

25 is calculated for the complete image to separate the cells from background and accoimt 
for field to field variation. These global adaptive techniques are variants of those 
described in the art. (Kittler et al. in Computer Vision, Graphics, and Image 
Processing 30 (1985), 125-147, Ridler et al. in IEEE Trans. Systems, Man, and 
Cybernetics (1 978), 630-632.) 

30 An alternative adaptive thresholding method utilizes local region thresholding 

in contrast to global image thresholding. Image analysis of local regions leads to better 
overall segmentation since staining of cell nuclei (as well as other labeled components) 
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can vary across an image. Using this global/local procedure, a reduced resolution 
image (reduced in size by a factor of 2 to 4) is first globally segmented (using adaptive 
thresholding) to find regions of interest in the image. These regions then serve as 
guides to more fully analyze the same regions at fixll resolution. A more localized 
thrbshold is then calculated (again using adaptive thresholding) for each region of 
interest. ' ^ 

^ The output of the s(5gmentation procedure is a binary image wherein the objects 
are white and the background is black. This binary image, also called a mask in the art, 
is used to determine if the field contains objects 107. The mask is labeled with a blob 
labeling method whereby each object (or blob) has a unique number assigned to it 
Morphological features, such as area and shape, of the blobs are used to differentiate 
blobs likely to be cells from those that are considered artifacts. The liser pre-sets the 
morphological selection criteria by either typing in known cell morphological features 
ori by using the interactive training utility. If objects of interest are found in the field, 
images are acquired for all other active channels 108. otherwise the stage is advanced 
to the next field 109 in the current well. Each object of interest is located in the image 
for further analysis 1 10 . The software distermines if the object meets the criteria for a 
valid cell nucleus 1 11 bv meaisuring its morphological features (size and shape). For 
each valid cell, die XYZ stage location is recorded, a small image of the cell is stored, 
and features are measured 112 . 

The cell scanning method of the present invention can be used to perform many 
different assays on cellular samples by applying a number of analytical methods 
simultaneously to measure features at multiple wavelengths. An example of one such 
assay provides for the following measurements: 

1 . The total fluorescent intensity within the cell nucleus for colors 1-^ 

2. The area of the cell nucleus for color 1 (the primary marker) 

3. The shape of the cell nticleus for color 1 is described by three shape 
features: 

a) perimeter squared area 
by box irea ratio 
c) height width ratio 

4. The average fluorescent intensity within the cell nucleus for colors 1-4 (i.e. 
#1 divided by Wly " 

5. The total fluorescent intensity of a ring outside the nucleus (see Figure 10) 
that represents fluorescence of the cell's cytoplasm (cytoplasmic mask) for 
colors 2-4 
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The area of the cytoplasmic mask ; > , 

The average fluorescent intensity of the cytoplasmic mask for colors 2-4 
(i.e. #5 divided by #6) 

The ratio of the average fluorescent intensity of the cytoplasmic niask to 
average fluorescent intensity within the cell nucleus for colors 2-4 (i.e. #7 
dividedby #4) 

The difference of the average fluorescent intensity of the cytoplasmic mask 
and the average fluorescent intensity within the cell nucleus for colors 2-4 
(i.e. #7 minus #4) v 
The nimiber of fluorescent domains (also call spots, dots, or grains) within 
the cell nucleus for colors 2-4 

Features 1 through 4 are general features of the different cell screening assays 
of the invention. These steps are commonly used in a variety of image analysis 

15 applications and are well kftdwn in art (Russ (1992) The Imdge Processing Handbook, 
CRC Press Inc.; Gonizales et al. (1987), Digital Image Processing. Addisori- Wesley 
Publishing Co. pp. 391-448). Features 5-9 have been dbveloped specifically to provide 
measiiremerits of a cell's fluorescent molecules within the local cytoplasmic region of 
the cell and the translocation (i.e. moveinent) of fluorescent molecules from the 

20 cytoplasm to the nucleus. These featufeis (steps 5-9) are used for aniaiyzirig cells in 
microplates for the inhibition of nuclear translocation. Fdr Example, inhibitloh of 
nuclear translocation of transcription factors provides a liovel approach to screening 
intact cells (detailed examples of other types of screens will be provided below). 
specific method measures the amount of probe in the nuclear region (fea&re 4) versus 

25 the local cytoplaismic region (feature 7) of each cell. Quaritificatiori of the difference 
between these two sub-cellulair compartrriehts provides a measure of cytoplasm-nuclear 
translocation (feature 9). 

Featiire 10 describes a screbn lised for counting of DNA or RNA probes within 
the nuclear region in colors 2-4. For example, probes are commeircially' available for 

30 identifying chroiribsomd-specific DNA sequra^ (Life Techiibiogies, Gaithbrsburg, 
MD; Genosys, Woodlands, TX; Biotechnolbgiek, Inc:, Richmoiid/ CA; Bid 101, liib., 
Vista, CA) Cells are three-diihehsion^^ in natiwe and when examined at a high 
magnification under a microscope one probe maty be in-fociis while 'another may be 
completely 'Mf-6f-fbciiS!'The'be sereehirig^metlMi^^o^ 

35 for detecting threse-dimehsidriaJ probes in nuclei by acquiring imiges frbih multiple 

focal planes. The software moves the Z-axis motor drive 5 (Figure 1) in small steps 
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where the step distance is user selected to account for a wide range of different nuclear 
diameters. At each of the focal steps, an image is acquired. The maximum gray-level 
intensity from each pixel in each image is found and stored in a resulting maximum 
projection image. The maximum projection image is then used to count the probes. The 
5 above method works well in counting probes that are not stacked directly above or 
below another one. To account for probes stacked on top of each other in the Z- 
direction, users can select an option to analyze probes in each of the focal planes 
acquired. In this mode, the scanning system performs the maximum plane projection 
method as discussed above, detects probe regions of interest in this image, then further 

10 analyzes these regions in all the focal plane images. 

After measuring cell features 112 (Figure 9), the system checks if there are any 
unprocessed objects in the current field 113. If there are any unprocessed objects, it 
locates the next object 110 and determines whether it meets the criteria for a valid cell 
nucleus 11 K and measures its features. Once all the objects in the current field are 

15 processed, the system determines whether analysis of the current plate is complete 114 : 
if not, it detemiines the need to find more cells in the current well 115 . If the need 
exists, the system advances the XYZ stage to the next field within the current well 109 
or advances the stage to the next well 116 of the plate. 

After a plate scan Is complete, images and data can be reviewed with the 

20 system's image review, data review, and summary review facilities. All images, data, 
and settings from a scan are archived in the system's database for later review or for 
interfacing with a network information management system. Data can also be exported 
to other third-party statistical packages to tabulate results and generate other reports. 
Users can review the images alone of every cell analyzed by the system with an 

25 interactive image review procedure 117 . The user can review data on a cell-by-cell 
basis using a combination of interactive graphs, a data spreadsheet of measured 
features, and images of all the fluorescence channels of a cell of interest with the 
interactive cell-by-cell data review procedure 118 . Graphical plotting capabilities are 
provided in which data can be analyzed via interactive graphs such as histograms and 

30 scatter plots. Users can review summary data that are accumulated and summarized for 
all cells within each well of a plate with an interactive well-by-well data review 
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procedure 119 . Hard copies of graphs and images can be printed on a wide range of 
standard printers. 

As a final phase of a complete scan, reports can be generated on one or more 
statistics of the measured features. Users can generate a graphical report of data 
5 summarized on a well-by-well basis for the scanned region of the plate using an 
interactive report generation procedure 120 . This report includes a summary of the 
statistics by well in tabular and graphical format and identification information on the 
sample. The report window allows the operator to enter comments about the scan for 
later retrieval. Multiple reports can be generated on many statistics and be printed with 
10 the touch of one button. Reports can be previewed for placement and data before being 
printed. 

The above-recited embodiment of the method operates in a single high 
resolution mode referred to as the high content screening (HCS) mode. The HCS mode 
provides sufficient spatial resolution within a well (on the order of 1 ^m) to define the 
15 distribution of material within the well, as well as within individual cells in the well. 
The high degree of information content accessible in that mode, comes at the expense 
of speed and complexity of the required signal processing. 

In an alternative embodiment, a high throughput system (HTS) is directly 
coupled with the HCS either on the same platform or on two separate platforms 
20 connected electronically (e.g. via a local area network). This embodiment of the 
invention, referred to £is a dual mode optical system, has the advantage of increasing the 
throughput of an HCS by coupling it with an HTS and thereby requiring slower high 
resolution data acquisition and analysis only on the small subset of wells that show a 
response in the coupled HTS. 

25 High throughput 'whole plate' reader systems are well known in the art and are 

commonly used as a component of an HTS system used to screen large numbers of 
compounds (Beggs et al. (1997), supra\ McCaffrey et al. (1996), supra ). The HTS of 
the present invention is carried out on the microtiter plate or mictowell array by reading 
many or all wells in the plate simultaneously with sufficient resolution to make 

30 determinations on a well-by-well basis. That is, calculations are made by averaging the 
total signal output of many or all the cells or the bulk of the material in each well. 
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Wells that ekhibit some defined response in the HTS (the *hits') are ^fl^^ 

system. Then on the same microtiter plate or microwell array, each well identified as a 

hit is measured via HCS as described above. Thus, the dual mode process involves: 

1 . Rapidly measuring numerous wells of a microtiter plate or microwell array, 

5 2. Iiiterpreting the data to determine the overall activity of ftuorescently labeled 
reporter molecules in the cells on a well-by--well basis to identify "hits" (wells that 
exhibit a defined response), 

3 . Imaging numerous cells in each "hit" well, and 

4. Interpreting the digital iraa.ge data to determine the distribution, environment or 
1 0 activity of thfe fliibriscj^htly labeled r^brtffl- molecules iii the iridividiM cells (i.e. 

intracellular measurements) and the distribution of the cells to test for specific 
biological functions 

in d prefeired eir^ of dual mode processing (Figure 11), at the start of a 

15 nin 301; ^he^operatbr enters information 3^02 that describes the plate' and its contents, 
specifies the filfei^ settings md fluorescent channels to match the biological labelslDeirig 
used, the mfbrrhiadon 'sbu^t andlhe camerk settiiigs to match the sample brightriess. 
TThiese paian^e^ scored nl thei 'Vystem*s da^ for easy reWeval aFor ©abh 
automated run. The microtiter plate or fmcro is Ibaded into the cell screening 

20 system 303 either -manually or automatically by controlling a robotic loading device. 
An optional ^environmental chamber 304 is controlled byothe system to maintain the 
temperature^ humidity and CO2 levels in the air surrounding live cells in the microtiter 
plate orr microwell array. An optional fluid delivery device 305 (see Figure 8) is 
controlled by the system to dispense fluids into the wells during the iscan. 

25 High throughput processing 306 is first performed dri the rnicfotiter plate oi- 

microwell array by acquiring and analyzing the signal from each of the wells in the 
plate. The processing performed in high throughput naqde 307 is illustrated in Figure 12 
and described below. Wells that exhibit some selected intensity response in this high 
throughput mode ("hits") are identified by the - system. The system performs a 

30 conditional operation 308 that tests for hits. If hits are found, those specific hit wells are 
further analyzed in high coiltent (micro, level) mode 309. The processing pisrformed in 
high content mode. 312 is illustrated in Figure 13; The system then updates 3IQ the 
informatics database 31 1 with results of the measurements on the plate. If there are 
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more plates to be analyzed 313 the system loads the next plate 303: otherwise the 
analysis of the plates terminates 314 . 

The following discussion describes the high throughput mode illustrated in 
Figure 12, The preferred embodiment of the system, the single platform dual mode 
5 screening system, will be described. Those skilled in the art will recognize that 
operationally the dual platform system simply involves moving the plate between two 
optical systems rather than moving the optics. Once the system has been set up and the 
plate loaded, the system begins the HTS acquisition and analysis 401 . The HTS optical 
module is selected by controlling a motorized optical positioning device 402 on the 

10 dual mode system. In one fluorescence channel, data from a primary marker on the 
plate is acquired 403 and wells are isolated from the plate background using a masking 
procedure 404 . Images are also acquired in other fluorescence channels being used 405 . 
The region in each image corresponding to each well 406 is measured 407 . A feature 
calculated from the measurements for a particular well is compared with a predefined 

15 threshold or intensity response 408. and based on the result the well is either flagged as 
a "hit" 409 or not. The locations of the wells flagged as hits are recorded for 
subsequent high content mode processing. If there are wells remaining to be processed 
410 the program loops back 406 until all the wells have been processed 411 and the 
system exits highlhroughput mode. 

20 Following HTS analysis, the system starts the high content mode processing 

501 defined in Figure 13. The system selects the HCS optical module 502 by 
controlling the motorized positioning system. For each "hit" well identified in high 
throughput mode, the XY stage location of the well is retrieved from memory or disk 
and the stage is then moved to the selected stage location 503. The autofocus procedure 

25 504 is called for the first field in each hit well and then once every 5 to 8 fields within 
each well. In one channel, images are acquired of the primary marker 505 (typically 
cell nuclei counterstained with DAPI, Hoechst or PI fluorescent dye). The images are 
then segmented (separated into regions of nuclei and non-nuclei) using an adaptive 
thresholding procedure 506 . The output of the segmentation procedure is a binary mask 

30 wherein the objects are white and the background is black. This binary image, also 
called a mask in the art, is used to determine if the field contains objects 507 . The mask 
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is labeled with a blob labeling method whereby each object (or blob) has a unique 
number assigned to it. If objects are found in the field, images are acquired for all other 
active channels 508> otherwise the stage is advanced to the next field 514 in the current 
well, Each object is located in the image for further analysis 509 . Morphological 
5 features, such, as area and shape of the objects, are used to select objects likely to b^e 
cell nuclei 510> and discard (do no fiirther processing on) those that are considered 
artifacts. For each valid cell nucleus, the XYZ stage location is recorded, a small image 
of the cell is stored, and assay specific features are measured 51 L The system then 
performs multiple tests on the cells by applying several analytical methods to measure 

10 features at each of severaLwavelengths. After measuring the cell features, the systems 
checks if there are any unprocessed objects in the current field 512. If there are any 
unprocessed objects, it locates the next object 509 and determines whether it meets the 
criteria for a valid cell nucleus 510, and measures its features. After processing all the 
objects in the current field, the system deteremines whether it needs to find more cells 

15 or fields in the. current well 513 . If it needs to find more cells or fields in the current 
well it advances the 5CYZ stage to the next field within the v current well 515 . 
Qtherwise, the system checks whether it has any remaining hit Ayells to measure 51^. ; If 
so, it advances to the next hit well 503 and proceeds through another cycle of 
acquisition and analysis, otherwise the HCS mode is finished 5JL6. / 

20 In an alternative embodiment of the present invention, a . method^ of kinetic live 

cell screening is provided. The previously described embodiments of the invention are 
used to characterize the spatial distribution of cellular components at a specific point in 
time, the time, of chemical fixation. As such, these embodiments .have limited utility 
for implementing kinetic based screens, due to the sequential nature of the image 

25 acquisition, and the amount of time required to read all the wells on a plate. For 
example, since a plate can require 30-60 minutes to read through all the wells, only 
very slow kinetic processes can be measured by simply preparing a plate of live cells 
and then reading through all the wells more than once. Faster kinetic processes can be 
measured by taking multiple readings of each.welLbefore proceeding to the next .well, 

30 but the elapsed time between the first and last well would be too long, and fast kinetic 
processes would likely be complete before reaching the last well. 

32 



wo 00/50872 



PCT/USOO/04794 



The kinetic live cell extension of the invention enables the design and use of 
screens in which a biological process is characterized by its kinetics instead of, or in 
addition to, its spatial characteristics. In many cases, a response in live cells can be 
measured by adding a reagent to a specific well and making multiple measurements on 
5 that well with the appropriate timing. This dynamic live cell embodiment of the 
invention therefore includes apparatus for fluid delivery to individual wells of the 
system in order to deliver reagents to each well at a specific time in advance of reading 
the well. Tliis embodiment thereby allows kinetic measurements to be made with 
temporal resolution of seconds to minutes on each well of the plate. To improve the 
10 overall efficiency of the dynamic Hve cell system, the acquisition control program is 
modified to allow repetitive data collection from sub-regions of the plate, allowing the 
system to read other wells between the time points required for an individual well. 

Figure 8 describes an example of a fluid delivery device for use with the live 
cell embodiment of the invention and is described above. This set-up allows one set of 

15 pipette tips 705, or even a single pipette tip, to deliver reagent to all the wells on the 
plate. The bank of syringe pimips 701 can be used to deliver fluid to 12 wells 
simultaneously, or to fewer wells by removing some of the tips 705 . The temporal 
resolution of the system can therefore be adjusted, without ^sacrificing data collection 
efficiency, by changing the number of tips and the scan pattern as follow^. Typically, 

20 the data collection and analysis fi-om a single well takes about 5 seconds. Moving fi-om 
well to well and focusing in a well requires about 5 seconds, so the overall cycle time 
for a well is about 10 seconds. Therefore, if a single pipette tip is used to deliver fluid 
to a single well, and data is collected repetitively from that well, measurements can be 
made with about 5 seconds temporal resolution. If 6 pipette tips are used to deliver 

25 fluids to 6 wells simultaneously, and the system repetitively scans all 6 wells, each scan 
will require 60 seconds, thereby establishing the temporal resolution. For slower 
processes which only require data collection every 8 minutes, fluids can be delivered to 
one half of the plate, by moving the plate during the fluid delivery phase, and then 
repetitively scanning that half of the plate. Therefore, by adjusting the size of the sub- 

30 region being scanned on the plate, the temporal resolution can be adjusted without 

having to insert wait times between acquisitions. Because the system is continuously 

scanning and acquiring data, the overall time to collect a kinetic data set firom the plate 
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is then simply the time to perform a single scan of the plate, multiplied by the number 
of time points required. Typically, 1 time point before addition of compounds and 2 or 
3 time points following addition should be sufficient for screening purposes. 

Figure 14 shows the acquisition sequence used for kinetic analysis. The start of 
5 processing 801 is configuration of the system, much of which is identical to the 
standard HCS configuration. In addition, the operator must enter infomiation specific 
to the kinetic analysis being performed 802, such as the sub-region size, the number of 
time points required, and the required time increment. A sub-region is a group of wells 
that will be scanned repetitively in order to accumulate kinetic data. The size of the 

10 sub-region is adjusted so that the system can scan a whole sub-region once during a 
single time increment, thus minimizmg wait times. The optimuni sub-region size is 
calculated from the setup parameters, and adjusted if necessary by the operator. » The 
system then moves the plate to the first sub-region 803. and to the first well in that sub- 
region 1Q4 to acquire the prestinlulatibh (time ^ 0) time points. The acquisition 

15 sequences perfdimed in each well is exactly the same as that required for the specific 
HGS being run in Idnetic mode. Figure 15 details a flow chart for that processing. All 
of the steps between the lstart 9Qi and the return 902 zu-e identical to those described as 
sieps- 5Q4 " 5l4 in Fiieure 13, / ' 

After processing each well in a sub-region, the system checks to see if all the 
20 wells in the sub-region have been processed W6 (Figure 14), and cycles through ail the 
wells until the whole region has been processed. The system then moves the plate into 
position for fluid addition, and controls fluidic system delivery of fluids to tlie entire 
sub-region 807 . Tthis may require multiple additions for siiS-fegions which span 
several rows on the plate, with the system moving the plate on the X,Y stage between 
25 additions. Once the fluids have been added, the system moves to the first well in the 
sutj-region 808 to begin acquisition of tiriie points. The data is acquired from each well 
809 and as before the system cycles through all the wells in the sub-region 8 lb . After 
each pass through the sub-region, the system checks whether all the time points have 
been collecteH 811 ai^l if hot^ pauses 813 if necessary 812 to stay sjmchroiiized with the 
30 requested time increment. Otherwise, the system checks for additional sub-regions on 
the plate ^8 14 and either moves to the next sub-region 803 or finishes 8 15 . Thus, the 
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kinetic analysis mode comprises operator identification of sub-regions of the naicrotiter 
plate or microwells to be screened, based on the kinetic response to be investigated, 
with data acquisitions within a sub-region prior to data acquisition in subsequent sub- 
regions. 

5 Specific Screens 

. In another aspect of the present invention, cell screening methods and machine 
readable storage medium comprising a program containing a set of instructions for 
causing a cell screening system to execute procedures for defining the distribution and 
activity of specific cellular constituents and processes is provided. In a preferred 

10 embodiment, the cell screening system comprises a high magnification fluorescence 
optical system with a stage adapted for holding cells and a means for moving the stage, 
a digital camera, a light source for receiving and processing the digital data from the 
digital camera, and a computer means for receiving and processing the digital data from 
the digital camera. This aspect of the invention comprises programs that instruct the 

15 cell screening system to defme the distribution and activity of specific cellular 
constituents and processes, using the luminescent probes, the optical imaging system, 
and the pattern recognition software of the invention. Preferred embodiments of the 
machine readable storage medium comprise programs consisting of a set of instructions 
for causing a cell screening system to execute the procedures set forth in Figures 9, 1 1, 

20 12, 13, 14 or 15. Another preferred embodiment comprises a program consisting of a 
set of instructions for causing a cell screening system to execute procedures for 
detecting the distribution and activity of specific cellular constituents and processes. In 
most preferred embodiments, the cellular processes include, but are not limited to, 
nuclear translocation of a protein, cellular morphology, apoptosis, receptor 

25 intemalization, and protease-induced translocation of a protein. 

In a preferred embodiment, the cell screening methods are used to identify 
compounds that modify the various cellular processes. The cells can be contacted with 
a test compound, and the effect of the test compound on a particular cellular process 
can be analyzed. Alternatively, the cells can be contacted with a test compound and a 
30 known agent that modifies the particular cellular process, to determine whether the test 
compound can inhibit or enhance the effect of the known agent. Thus, the methods can 



BNSDOCID: cWO_00S0972A2_i_> 



wo 00/50872 



PCT/USOO/04794 



be used to identify test compounds that increase or decrease a particular cellular 
response, as well as to identify test compounds that affects the ability of other agents to 
increase or decrease a particular cellular response. 

In another preferred embodiment, the locations containing cells are analyzed 
5 using the above methods at low resolution in a high throughput mode, and only a subset 
of the locations containing cells are analyzed in a high content mode to obtain 
luminescent signals from the luminescently labeled reporter molecules in subcellular 
compartments of the cells being analyzed. 

The following examples are intended for purposes of illustration only and 
10 should not be construed to limit the scope of the invention, as defined in the claims 
appended hereto. 

The various chemical compounds, reagents, dyes, and antibodies that are 
referred to in the following Examples are commercially available fi-om such sources as 
Sigma Chemical (St. Louis, MO), Molecular Probes (Eugene, OR), Aldrich Chemical 
15 Company (Milwaukee, WI), Accurate Chemical Company (Westbury, NY), Jackson 
Lnmunolabs, and Clontech (Palo Alto, CA). 

Example J Cytoplasm to Nucleus Translocation Screening: ^ 

a. Transcription Factors 

Regulation of transcription of some genes involves activation of a transcription 
factor in the cytoplasm, resulting in that factor being transported into the nucleus where 
it can initiate transcription of a particular gene or genes. This change in transcription 
factor distribution is the basis of a screen for the cell-based screening system to detect 
compounds that inhibit or induce transcription of a particular gene or group of genes. 
A general description of the screen is given followed by a specific example. 

The distribution of the transcription factor is determined by labeling the nuclei 
with a DNA specific fluorophore like Hoechst 33423 and the transcription factor with a 
specific fluorescent antibody. After autofocusing on the Hoechst labeled nuclei, an 
image of the nuclei is acquired in the cell-based screening system and used to create a 
mask by one of several optional thresholding methods, as described supra. The 
morphological descriptors of the regions defined by the mask are compared with the 

36 



20 



25 



30 



3IMSDOCID: <WO_0050B72A2J_> 



wo 00/50872 PCT/USDp/04794 

user defined parameters and valid nuclear masks are identifieid and used withr the 
following method to extract transcription factor distributions. Each valid nuclei-; mask 
is eroded to define a slightly smaller nuclear region. The original nuclear mask is then 
dilated in two steps to define a ring shaped region around the nucleus, which represfmts 

5 a; cytpplasmic region, The average antibody fluorescence in each of these two regions 
is (ietermined, and the , difference between these averages is defined as the NucCyt 
Difference. Two examples of determining nuclear translocatipn are discussed below 
and illustrated in Figure IDA- J. Figure lOA illustrates an unstimulated cell with its 
nucleus 200 labeled with a blue fluorophore and a trmscriptipn factor in the cytoplasm 

10 201 labeled with a green fluorophore, Figui^ei lOBJllustrates the nuclear mask 222 
derived by the cell-based screening system.,. Figure IOC iljustrates the cytoplasm 
of the unstimulated cell imaged at a green wavelength, Figure lOD illustrates the 
nuclear mask 202 is eroded (reduced) once to define a nuclear sampling region 204 
with minimal cytoplasmic distribution. The nucleus boundary 202 is dilated (exipanded) 

15 several times to fomi a ring that is 2-3 pixels wide tiiat is used to define the 
cytoplasmic, sampling region 205 for thp same cell Figure lOEiurther illustrates a side 
yiesw which shows the nuclear sampling^ region 2Q4 and the cytoplasmic sampling 
region 205 . Using these two sampling regions, data on nuclear Iranslocation can be 
automatically analyzed by the cell-based screming system on a cdl by cell ^asis. 

20 Figure lOF-J illustrates the strategy for determining nuclear translocation in a 
stimulated cell. Figure lOF illustrates a stimulated cell with its nucleus 20^ labeled with 
a blue fluorophore and a transcription factor in the cytoplasm 207 labeled with a green 
fluorophore. The nuclear mask 208 in Figure lOG is derived by the cell based 
screening system. Figure l OH illustrates, the cytoplasm 202 of a stimulated cell imaged 

25 at a green wavelength. Figure 101 illustrates the nuclear sampling region 211 and 
cytpplasmic sampling region 212 of the stimulated cell. Figure lOJ further illustrates a 
side view which shows the nuclear sampling region 211 and the cytoplasmic sampling 
region 212 , 

A specific application of this method has been used tp validate this method . as a 
30 screen. A human cell line was plated in 96 well micrptiter plates. Some rows of wells 
were titrated with IL-1, a known inducer of the NF-KB transcription factor. The cells 
were then fixed and stained by standard methods with a fluorescein labeled antibody to 
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the transcription factor, and Hoechst 33423. The cell-based screening system was used 
to acquire and analyze images from this plate and the NucCyt Difiference was found to 
be strongly correlated with the amount of agonist added to the wells as illustrated in 
Figure 16. In a second experiment, an antagonist to the receptor for IL-1, IL-IRA was 
titrated in the presence of IL-la, progressively inhibiting the translocation induced by 
IL-1 a. The NucCyt Difference was found to strongly correlate with this inhibition of 
translocation, as illustrated in Figure 17. 

Additional experiments have shown that the NucCyt Difference, as well as the 
NucCyt ratio, gives consistent results over a wide range of cell densities and reagent 
concentrations, and can therefore be routinely used to screen compound libraries for 
specific nuclear translocation activity. Furthermore, the same method can be used with 
antibodies to other transcription factors, or GFP-transcription factor chimeras, or 
fluorescently labeled transcription factors introduced into living or fixed cells, to screen 
for effects on the regulation of transcription factor activity. 

Figure 18 is a representative display on a PC screen of data which was obtained 
in accordance with Example 1 . Graph I 180 plots the difference between the average 
antibody fluorescence in the nuclear sampling region and cytoplasmic sampling region, 
NucCyt Difference verses Well #. Graph 2 181 plots the average fluorescence of^the 
antibody in the nuclear sampling region, NPl average'^ versus the Well #. Graph 3 182 
plots the average antibody fluorescence in the cytoplasmic sampling region, LIPl 
average, versus Well #. The software permits displaying data from each cell. For 
example. Figure 18 shows a screen display 183. the nuclear image 184, and the 
fluorescent antibody image 185 for cell #26. 

NucCyt Difference referred to in graph 1 180 of Figure 18 is the difference 
between the average cytoplasmic probe (fluorescent reporter molecule) intensity and 
the average nuclear probe (fluorescent reporter molecule) intensity. NPl average 
referred to in graph 2 181 of Figure 18 is the average of cytoplasmic probe (fluorescent 
reporter molecule) intensity within the nuclear sampling region, LlPl average referred 
to in graph 3 182 of Figure 18 is the average probe (fluorescesnt reporter molecule) 
intensity within the cytoplasmic sampling region. 

It will be understood by one of skill in the art that this aspect of the invention 
can be performed using other transcription factors that translocate from the cytoplasm 
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to the nucleus upon activation. In another specific example, activation of the c-fos 
transcription factor was assessed by defining its spatial position within cells. Activated 
c-fos is found only within the nucleus, while inactivated c-fos resides within the 
cytoplasm. 

5 3T3 cells were plated at 5000-10000 cells per well in a Polyfiltronics 96-well 

plate. The cells were allowed to attach and grow overnight. The cells were rinsed 
twice with 100 jil serum-free medium, incubated for 24-30 hours in serum-free MEM 
culture medium, and then stimulated with platelet derived growth factor (PDGF-BB) 
(Sigma Chemical Co., St, Louis, MO) diluted directly into seram free medium at 

10 concentrations ranging from 1-50 ng/ml for an average time of 20 minutes. 

Following stimulation, cells were fixed for 20 minutes in 3.7% formaldehyde 
solution in IX Hanks buffered saline solution (HBSS). Aflier fixation, the cells were 
washed with HBSS to remove residual fixative, permeabilized for 90 seconds with 
0.5% Triton X-100 solution in HBSS, and washed twice with EBSS to remove residual 

15 detergent. The cells were then blocked for 15 minutes with a 0.1% solution of BSA in 
HBSS, and ftirther washed with HBSS prior to addition of diluted primary antibody 
solution. 

c-Fos rabbit polyclonal antibody (Calbiochem, PC05) was diluted 1:50 in 
HBSS, and 50 fil of the dilution was applied to each well.^ Cells were incubated in the 

20 presence of primary antibody for one hour at room temperatiu-e, and then incubated for 
one hour at room temperature in a light tight container with goat anti-rabbit secondary 
antibody conjugated to ALEXA™ 488 (Molecular Probes), diluted 1:500 from a 100 
^ig/ml stock in HBSS. Hoechst DNA dye (Molecular Probes) was then added at a 
1:1000 dilution of the manufacturer's stock solution (10 mg/ml). The cells were then 

25 washed with HBSS, and the plate was sealed prior to analysis with the cell screening 
system of the invention. The data from these experiments demonstrated that the 
methods of the invention could be used to measure transcriptional activation of c-fos by 
defining its spatial position within cells. 

One of skill in the art will recognize that while the following method is applied to 

30 detection of c-fos activation, it can be applied to the analysis of any transcription factor 
that translocates from the cytoplasm to the nucleus upon activation. Examples of such 
transcription factors include, but are not limited to fos and jim homologs, NF-KB 
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(nuclear factor kappa firom B cells), NFAT (nuclear factor of activated T-lymphocytes), 
and STATs (signaLtransducer and activator of transcription) factors (For example, see 
Strehlow, L, and Schindler, C. 1998. J. BioL C/iem; 273:28049^28056; Ghow, et al. 
1997 Science. 278:1638-1641; Ding et al. 1998 J: BioL Chem, 273:28897-28905; 
5 Baldwin^ 1996. Annu Rev Immunol 14:649-83; Kuo, C.T., and . Leiden. 1999. 
Annu Rev Immunol 17:149-87; Rao, et al. 1997: Annu Rev Immunol 15:707-47; 
Masuda,et al. 1998. - Ce// Signal 10:599-611; Hoey, T., and U. Schindler. 1998* Curr 
Opin Genet Dev: 8:582-7; Liu, et al. 1998. Gurr Opin Immunol 10:271-8,) 

Thus, in this aspect of the invention, indicator cells are.:treated ^yith, test 
10 compounds and the distribution of luminescently labeled transcriptipn factor is 
measured in space and time using a cell screening system, such as the one disclosed 
above. The luminescently labeled transcription factor may be expressed by or^added to 
the cells either before, together with, or after contacting the cells with a test compound, 
i V- For example,; the transcription factor may be expressed as a luminescently 
15 labeled protein chimera by transfected indicator cells. Alternatively, the luminescently 
labeled transcription factor may be expressed^ isolated; and bulk-loaded into the 
indicator cells as described above, or the transcription factor may be lummescently 
labeled after isolation. As a further altemativej the. transcription factor is expressed by 
the indicator ceil, which is subsequently contactediwith a luminescent label, such as an 
20 antibody, that detects the transcription factor. 

> In a^'fiirther aspect, kits are provided for analyzing transcription factor activation, 
comprising an antibody that specifically /recognizes , a transcription factor of interest, 
V and instructions for using the antibody for carrying out. the methods described above. 
In a preferred: embodiment, the transcription factor-specific antibody, or a secondary 
25 ^antibody that detects the transcription factor antibody, is luminescently labeled. In 
' furthisr preferred embodiments, the kit contains cells that express the transcription 
factor of interest, and/or the kit contains a compound that is known to modify aetdvatipn 
of the transcription factor of interest, including but, not limited to platelet derived 
growth factor (PDGF) and serum, which both modify fos activation; and interleukin 
1 (IL- 1 ) and tumor necrosis factor (TOT)i:W^^ both modify NF^KB activation; - ' ^ 

In another embodiment,; the; ^kit comprises a recombinant expression vector 
cbrhprising a nucleic acid encoding "a transcription factor* of interest that translocates 
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from the cytoplasm to the nucleus upon activation, and instructions for using the 
expression vector to identify compounds that modify transcription factor activation in a 
cell of interest. Alternatively, the kits contain a purified, luminescently labeled 
transcription factor. In a preferred embodiment, the transcription factor is expressed as 
5 a fusion protein with a luminescent protein, including but not limited to green 
fluorescent protein, luceriferase, or mutants or fragments thereof. In various preferred 
embodiments, the kit further contains cells that are transfected with the expression 
vector, an antibody or fragment that specifically bind to the transcription factor of 
interest, and/or a compound that is known to modify activation of the transcription 
10 factor of interest (as above). 

b. Protein Kinases 

The cytoplasm to nucleus screening methods can also be used to analyze the 
activation of any protein kinase that is present in an inactive state in the cytoplasm and 

15 is transported to the nucleus upon activation, or that phosphorylates a substrate that 
translocates from the cytoplasm to the nucleus upon phosphorylation. Examples of 
appropriate protein kinases include, but are not limited to extracellular signal-regulated 
protein kinases (ERKs), c-Jun amino-terminal kinases (INKs), Fos regulating protein 
kinases (FRKs), p38 mitogen activated protein kinase (p38MAPK), protein kinase A 

20 (PKA), and mitogen activated protein kinase kinases (MAPKKs). (For example, see 
Hall, et al. 1999. J Biol Chem. 274:376-83; Han, et al. 1995. Biochim. Biophys. Acta. 
1265:224-227; Jaaro et al. 1997. Proc, Natl Acad. ScL U.S,A. 94:3742-3747; Taylor, et 
al. 1994. y. BioL Chem, 269:308-318; Zhao, Q., and F. S. Lee. 1999. J Biol Chem. 
274:8355-8; Paolilloet al. 1999. J Biol Chem. 274:6546-52; Coso et al. 1995. CeU 

25 81:1137-1146; Tibbies, L.A., and J.R. Woodgett 1999. Cell Mol Life ScL 55:1230-54; 
Schaeffer, H.J., and M.J. Weber. 1999. Mol Cell BioL 19:2435-44.) 

Alternatively, protein kinase activity is assayed by monitoring translocation of a 
luminescently labeled protein kinase substrate from the cytoplasm to the nucleus after 
being phosphorylated by the protein kinase of interest. In this embodiment, the 

30 substrate is non-phosphorylated and cytoplasmic prior to phosphorylation, and is 
translocated to the nucleus upon phosphorylation by the protein kinase. There is no 
requirement that the protein kinase itself translocates from the cytoplasm to the nucleus 
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in this embodiment. Examples of such substrates (and the corresponding protein 
kinase) include, but are not limited to c-jun (JNK substrate); fos (FRK substrate), and 
p38 (p38 MAPK substrate). 

Thus, in these embodiments, indicator cells are treated with test compounds and 
5 the distribution of luminescently labeled protein kinase or protein kinase substrate is 
measured in space and time using a cell screening system, such as the one disclosed 
above. The luminescently labeled protein kinase or protein kinase substrate may be 
expressed by or added to the cells either before, together with, or after contacting the 
cells with a test compound. For example, the protein kinase or protein kinase substrate 

10 may be expressed as a luminescently labeled protein chimera by transfected indicator 
cells. Alternatively, the luminescently labeled protein kinase or protein kinase 
substrate may be expressed, isolated, and bulk-loaded into the indicator cells as 
described above, or the protein kinase or protein kinase substrate may be luminescently 
labeled after isolation. As a further alternative, the protein kinase or protein kinase 

15 substrate is expressed by the indicator cell, which is subsequently contacted with a 
luminescent label, such as a labeled antibody, that detects the protein kinase or protein 
kinase substrate. 

In a fiirther embodiment, protein kinase activity is assayed by monitoring the 
phosphorylation state (ie: phosphorylated or not phosphorylated) of a protein kinase 

20 substrate. In this embodiment, there is no requirement that either the protein kinase or 
the protein kinase substrate translocate fi-om the cytoplasm to the nucleus upon 
activation. In a preferred embodiment, phosphorylation state is monitored by 
contacting the cells with an antibody that binds only to the phosphorylated form of the 
protein kinase substrate of interest (For example, as disclosed in U.S. Patent No. 

25 5,599,681). 

In another preferred embodiment, a biosensor of phosphorylation is used. For 
example, a luminescently labeled protein or fragment thereof can be ftised to a protein 
that has been engineered to contain (a) a phosphorylation site that is recognized by a 
protein kinase of interest; and (b) a nuclear localization signal that is unmasked by the 
30 phosphorylation. Such a biosensor will thus be translocated to the nucleus upon 
phosphorylation, and its translocation can be used as a measure of protein kinase 
activation. 
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In another aspect, kits are provided for analyzing protein kinase activation, 
comprising a primary antibody that specifically binds to a protein kinase, a protein 
kinase substrate, or a phosphorylated form of the protein kinase substrate of interest and 
instructions for using the primary antibody to identify compounds that modify protein 
5 kinase activation in a cell of interest. In a preferred embodiment, the primary antibody, 
or a secondary antibody that detects the primary antibody, is luminescently labeled. In 
otlier preferred embodiments, the kit further comprises cells that express the protein 
kinase of interest, and/or a compound that is known to modify activation of the protein 
kinase of interest, including but not limited to dibutyryl cAMP (modifies PKA), 

10 forskolin (PKA), and anisomycin (p38MAPK). 

Alternatively, the kits comprise an expression vector encoding a protein kinase 
or a protein kinase substrate of interest that translocates from the cytoplasm to the 
nucleus upon activation and instructions for using the expression vector to identify 
compounds that modify protein kinase activation in a cell of interest. Alternatively, the 

15 kits contain a purified, luminescently labeled protein kinase or protein kinase substrate. 
In a preferred embodiment, the protein kinase or protein kinase substrate of interest is 
expressed as a fusion protein with a luminescent protein. In further preferred 
embodiments, the kit further comprises cells that are transfected with the expression 
vector, an antibody or fragment thereof that specifically binds to the protein kinase or 

20 protein kinase substrate of interest, and/or a compound that is known to modify 
activation of the protein kinase of interest, (as above) 

In another aspect, the present invention comprises a machine readable storage 
medium comprising a program containing a set of instractions for causing a cell 
screening system to execute the methods disclosed for analyzing transcription factor or 

25 protein kinase activation, wherein the cell screening system comprises an optical 
system with a stage adapted for holding a plate containing cells, a digital camera, a 
means for directing fluorescence or luminescence emitted from the cells to the digital 
camera, and a computer means for receiving and processing the digital data from the 
digital camera. 

30 
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Example 2 Automated Screen for Compounds that Modify Cellular Moiphology 

Changes in cell size are associated with a nitmber of cellular conditiohs, stich as 
hypertrophy, cell attachment and spreading^ differentiation, growth and division, 
necrotic and programmed cell deiathj cell motility, morphogenesis, tube formation, and 
colony formation. 

For example, cellular hypertrophy has been associated with a cascade of 
alterations in gene expression and can be characterized in cell culture by an alteraition in 
ceill size, that is clearly visible in adherent cells growing on a coverslip. 

Cell size can also be measured to deterriiirie the attaicHment aiad spreading of 
adherent cells. Cell spreading is the result of selective binding of cell sutface receptors 
to substrate ligands and subsequent activation of signaling pathways to the 
cytoskeleton. Cell attachment and spreading to substrate molecules is an important step 
for the metastasis of cancer cells, leukocyte activation during the inflammatory 
response, keratinocyte movement during wound healing, and endothelial cell 
movement during angiogenesis. Compounds thait afTect these surface receptors, 
signaling pathways, or the cytoskeleton will affect ciell spreading and can be screened 
by measiuing cell size. 

Total cellular area can be monitored by labeling the entire cell body or the cell 
cytoplasm using cj^oskeletal markers, "'cytbso lie volume markers, or cell surface 
markers, in conjunction with a DNA label. Examples of such labels (many available 
from Molecular Probes (Eugene, Oregon) and Sigma Chemical Co. (St. Louis, 
Missouri)) include the following: 
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CELL SIZE AND AREA MARKERS 

Cytoskeletal Markers 

• ALEXA^^ 488 phalloidin (Molecular Probes. Oregon) 

• Tubulin-green fluorescent protein chimeras 

• Cvtokeratin-green fluorescent protein chimeras 

• Antibodies to cvtoskeletal proteins 



Cytosolic Volume Markers . . 

• Green fluorescent proteins . — 

• Chloromethvlfluorescein diacetate (CMFDA) 

• Calcein green — 

• BCECF/AM ester ^ 

• Rhodamine dextrati 

Cell Surface Markers for Lipid. Protein, or Oligosaccharide 

• Dihexadecvl tetramethvlindocarbocyanine perchlorate (DiIC 16) lipid dyes ^ 

m Triethylammonium propyl dibutvlamino stvrvl pyridiniu m (FM 4-64. FM l-43> lipid dyes 

» MITOTRACKER"^^ Green FM ^ __ 

• Lectins to oligosaccarides such as fluorescein concanavalin A or wheat germ agglutinin 

• SYPRO^ Red non-specific protein markers 

• Antibodies to various surface proteins such as epidermal growth factor _ — , 

• Biotin labeling of surface proteins followed by fluorescent s treoavidin labeleing 

Protocols for cell staining with these various agents are well known to those 
skilled in the art. Cells are stained Uve or after fixation and the cell area can be 

5 measured. For example, live cells stained with DiIC16 have homogeneously labeled 
plasma membranes, and the projected cro^s-sectional area of the cell is uniformly 
discriminated from background by fluorescence intensity of the dye. Live cells stained 
with cytosolic stains such as CMFDA produce a fluorescence intensity that is 
proportional to cell thickness. Although cell labeling is dimmer in thin regions of the 

10 cell, total cell area can be discriminated from background. Fixed cells can be stained 
^ with cytoskeletal markers such as ALEXA™ 488 phalloidin that label polymerized 
actin. Phalloidin does not homogeneously stain the cytoplasm, but still permits 
discrimination of the total cell area from background. 

15 Cellular hypertrophy 

A screen to analyze cellular hypertrophy is implemented using the following 
strategy. Primary rat myocytes can be cultured in 96 well plates, treated with various 
compounds and then fixed and labeled with a fluorescent marker for the cell membrane 
or cytoplasm, or cytoskeleton, such as an antibody to a cell surface marker or a 
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fluorescent marker for the cytoskeleton like rhodamine-phalloidin, in combination with 
a DNA label like Hoechst. 

After focusing on the Hoechst labeled nuclei, two images are acquired, one of 
the Hoechst labeled nuclei and one of the fluorescent cytoplasm image. The nuclei are 
identified by thresholding to create a mask and then comparing the morphological 
descriptors of the mask with a set of user defined descriptor values. Each non-nucleus 
image (or "cytoplasmic image") is then processed separately. The original cytoplasm 
image can be thresholded, creating a cytoplasmic mask image. Local regions containing 
cells are defined around the nuclei. The limits of the cells in those regions are then 
defined by a local dynamic threshold operation on the same region in the fluorescent 
antibody image. A sequence of erosions and dilations is used to separate slightly 
touching cells and a second set of morphological descriptors is used to identify single 
cells. The area of the individual cells is tabulated in order to define the distribution of 
cell sizes for comparison with size data from normal and hypertrophic cells. 

Responses fi-ora entire 96-well plates (measured as average cytoplasmic 
area/cell) were analyzed by the above methods, and the results demonstrated that the 
assay will perform the same on a well-to-well, plate-to-plate, and day-to-day basis 
(below a 15% cov for maximum signal). The data showed very good correlation for 
each day, and that there was no variability due to well position in the plate. 

The following totals can be computed for the field. The aggregate whole 
nucleus area is the number of nonzero pixels in the nuclear mask. The average whole 
nucleus area is the aggregate whole nucleus area divided by the total nmnber of nuclei. 
For each cjrtoplasm image several values can be computed. These are the total 
cytoplasmic area, which is the count of nonzero pixels in the cytoplasmic mask. The 
aggregate cytoplasm intensity is the sum of the intensities of all pixels in the 
cytoplasmic mask. The cytoplasmic area per nucleus is the total cytoplasmic area 
divided by the total nucleus count. The cytoplasmic intensity per nucleus is the 
aggregate cytoplasm intensity divided by the total nucleus count. The average 
cytoplasm intensity is the aggregate cytoplasm intensity divided by the cytoplasm area. 
The cytoplasm nucleus ratio is the total cytoplasm area divided by the total nucleus 
area. 
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Additionally, one or more fluorescent antibodies to other cellular proteins, such 
as the major muscle proteins actin or myosin, can be included. Images of these 
additional labeled proteins can be acquired and stored with the above images, for later 
review, to identify anomalies in the distribution and morphology of these proteins in 
5 hypertrophic cells. This example of a multi-parametric screen allows for simultaneous 
analysis of cellular hypertrophy and changes in actin or myosin distribution. 

One of skill in the art will recognize that while the example analyzes myocyte 
hypertrophy, the methods can be applied to analyzing hypertrophy, or general 
morphological changes in any cell type. 

10 

Cell morphology assays for prostate carcinoma 

Cell spreading is a measure of the response of cell surface receptors to substrate 
attachment ligands. Spreading is proportional to the ligand concentration or to the 
concentration of compounds that reduce receptor-ligand function. One example of 

15 selective cell-substrate attachment is prostate carcinoma cell adhesion to the 
extracellular matrix protein collagen. Prostate carcinoma cells metastasize to bone via 
selective adhesion to collagen. 

Compounds that interfere with metastasis of prostate carcinoma cells were 
screened as follows. PC3 human prostate carcinoma cells were cultured in media with 

20 appropriate stimulants and are passaged to collagen coated 96 well plates, Ligand 
concentration can be varied or inhibitors of cell spreading can be added to the wells. 
Examples of compounds that can affect spreading are receptor antagonists such as 
integrin- or proteoglycan-blockihg antibodies, signaling inhibitors including 
phosphatidyl inositol-3 kinase inhibitors, and cytoskeletal inhibitors such as 

25 cytochalasin D. After two hours, cells were fixed and stained with ALEXA™ 488 
phalloidin (Molecular Probes) and Hoechst 33342 as per the protocol for cellular 
hypertrophy. The size of cells under these various conditions, as measiu-ed by 
cytoplasmic staining, can be distinguished above background levels. The number of 
cells per field is determined by measuring the number of nuclei stained with the 

30 Hoechst DNA dye. The area per cell is found by dividing the cytoplasmic area 
(phalloidin image) by the cell number (Hoechst image). The size of cells is 
proportional to the ligand-receptor function. Since the area is determined by ligand 
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concentration and by the resultant function of the cell, dnig efficacy, as well as drug 
potency, can be determined by this cell-based assay. Other me^uremehts caii be made 
as discussed above for cellular hypertrophy. ' 

The methods for analyzing cellule riiOTpholdgy can be used in a combined high 
throughput-high content screen. In diie example, the high throughput mode scans the 
whole well for mcrease in fluoresCOTt pha^ A threshold ii set abbve 

which both nuclei (Hbechst) aiid 'cells (phalloidin) are measured in a high content 
mode. In another exiample, ah environmental biosensor (exami>les include, but are not 
limited to, those biosensors that are sensitive to calcium and pH changes) is added to 
the cells, and the cells are contacted with a compoimd. The cells are scanned in a high 
throughput mode, and those wells that exceed a pre-detennined threshold for 
luminescerice of the biosensor are scanned in a high content mode. 

In a further aspedt, kits are provided for analyzing cellular morphology, 
comprising a luminescent compound tHat can be used to specifically iabel the cell 
cytoplasni, membrane, or cytbskeletoh (such as those de^bribed above), and 
instfiictioris for using the iuminescent compound to identify test stimuU that induce or 
inhibit changes in cellular morphology according to the above methods. In a preferred 
embodiment^ the Idt further compiises^^a marker fot cell hUclei. In k further 

preferred embodiment, the kit comprises at least one compound that is known to 
modify cellular morphology, including, but not limited to integrin- or proteojglycan- 
blocking antibodies^ signaling inhibitors including phosphatidyl in6sitbl-3 kinase 
inhibitors, arid cytoskeletal inhibitors such as cytochalasin D. 

In another aspect, the present invention coihpriseis k machine readable storage 
medium comprising a program containing a bet 'of instructions for causing a cell 
screening system to execute the disclosed methods for analyzing cellular morphology, 
wherein the cell screening system comprises an optical system with a stage adapted for 
holding a plate containing cells, a digital camera, a means for directing fluorescence or 
luniinescence emitted from the cells -to the digital caihera, and a bompiiter means for 
rebeivirig and processing the digital data from the digital camera. 
Example' 3 = -^^Dudl Mode High Throughput and High'€<mtent Screlen' 

The following example is a screen for activation of a G-protein coupled receptor 
(GPGk) as detected by the translobiation of the GPCR from the plasma menibrkrie to a 
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proximal nuclear location. This example illustrates how a high throughput screen can 
be coupled with a high-content screen in the dual mode System for Cell Based 
Screening. 

G-protein coupled receptors are a large class of 7 trans-membrane domain cell 
5 surface receptors. Ligands for these receptors stimulate a cascade of secondary signals 
in the cell, which may include, but are not limited to, Ca^ transients, cyclic AMP 
production, inositol triphosphate (IP3) production and phosphorylation. Each of these 
signals are rapid, occuring in a matter of seconds to minutes, but are also generic. For 
example, many different GPCRs produce a secondary Ca"*^ signal when activated. 
10 Stimulation of a GPCR also results in the transport of that GPCR from the cell surface 
membrane to an internal, proximal nuclear compartment. This internalization is a much 
more receptor-specific indicator of activation of a particular receptor than are the 
secondary signals described above. 

Figure 19 illustrates a dual mode screen for activation of a GPCR. Cells 

15 carrying a stable chimera of the GPCR with a blue fluorescent protein (BFP) would be 
loaded with the acetoxymethylester form of Fluo-3, a cell permeable calcium indicator 
(green fluorescence) that is trapped in living cells by the hydrolysis of the esters. They 
would then be deposited into the wells of a microtiter plate 601. The wells would then 
be treated with an array of test compounds using a fluid delivery system, and a short 

20 sequence of Fluo-3 images of the whole microtiter plate would be acquired and 
analyzed for wells exhibiting a calcium response (i.e., high throughput mode). The 
images would appear like the illustration of the microtiter plate 601 in Figure 19. A 
small number of wells, such as wells C4 and E9 in the illustration, would fluoresce 
more brightly due to the Ca^ released upon stimulation of the receptors. The locations 

25 of wells containing compounds that induced a response 602, would then be transferred 
to the HCS program and the optics switched for detailed cell by cell analysis of the blue 
fluorescence for evidence of GPCR translocation to the perinuclear region. The bottom 
of Figure 19 illustrates the two possible outcomes of the analysis of the high resolution 
cell data. The camera images a sub-region 604 of the well area 603. producing images 

30 of the fluorescent cells 605 . In well C4, the uniform distribution of the fluorescence in 
the cells indicates that the receptor has not internalized, implying that the Ca"*^ response 
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seen was the result.of the stimulation of some other signalling system in the cell. The 
cells in well E9 606 on the other hand, clearly indicate a concentration of the receptor 
in the perinuclear region clearly indicating the full activation of the receptor. Becaiuse 
only a few hit \yells have to be anal>^ed with high resolution, the overall throughput of 
5 the dual mode system can be quite high, comparable to the high throughput system 
alone. 

Example 4 Kinetic High Content Screen 

The following is an example of a screen to measure the kinetics of 
10 internalization of a receptor. As described above, the stimulation of a GPCR, results in 
the internalization of the receptor, with a time course of about 15 min. Simply 
detecting the endpoint as internalized or not, may not be sufficient for defining the 
potency of a compound as a GPCR agonist or . antagonist. However, 3 time points at 5 
min intervals would provide information not only about potency during the time course 

15 of measurement, but would al^ allow extrapolation of Ae data to much longer tithe 
periods. To pelfonh this assay, the stib-regioh would be defined as two rows, ^^e 
samphng uiterval as 5 minutes and the total nuniber of time points 3. The system 
Would then start by sciaunirig two rows, and then adding reajgent to the two rows, 
establishing the time=0 reference. After reagent addition, the system would again scan 

20 the two row sub-region acquiring the first time point data. Since this process would 
take about 250 seconds, including scanning bac^k: to the beginning of the sub-region, flie 
system would wait 50 seconds to begin acquisition of the second tinie point Two more 
cycles would produce the three tiirie points £uid the system would move on to the 
seconia 2 row sub-region. The final two 2-rbw sub-regions would be scanned to finish 

25 all the wells on the plate, resulting in fotu: time points for each well over the whole 
platd. Although the time points for the wells would be offset slightly relative to 
tinie=0, the spaciiig of the time points would be very close to the required 5 itiinuteis, 
and the actual acquisition tinies and results recorded with much greater precision than 
in k fixed-cell screen. ^ - 

30 
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Example 5 High-content screen of human glucocorticoid receptor translocation 

One class of HCS involves the drug-induced dynamic redistribution of 
intracellular constituents. The human glucocorticoid receptor (hGR), a single "sensor" 
in the complex environmental response machinery of the cell, binds steroid molecules 
5 that have diffused into the cell. The ligand-receptor complex translocates to the 
nucleus where transcriptional activation occurs (Htun et al,, Proc. NatL Acad. Sci. 
93:4845, 1996). 

In general, hormone receptors are excellent drug targets because their activity 
lies at the apex of key intracellular signaling pathways. Therefore, a high-content 
10 screen of hGR translocation has distinct advantage over in vitro ligand-receptor binding 
assays. The availability of up to two more channels of fluorescence in the cell 
screening system of the present invention permits the screen to contain two additional 
parameters in parallel, such as other receptors, other distinct targets or other cellular 
. processes. 

15 Plasmid construct, A eukaryotic expression plasmid containing a coding 

sequence for a green fluorescent protein - human glucocorticoid receptor (GFP-hGR) 
chimera was prepared using GFP mutants (Palm et al., NaL Struct, Biol. 4:361 (1997). 
The construct was used to transfect a human cervical carcinoma cell line (HeLa). 

Cell preparation and tr/insfection. HeLa cells (ATCC CCL-2) were trypsinized 

20 and plated using DMEM containing 5% charcoal/dextran-treated fetal bovine serum 
(FBS) (HyClone) and 1% penicillin-streptomycin (C-DMEM) 12-24 hours prior to 
transfection and incubated at 37°C and 5% CO2 . Transfections were performed by 
calcium phosphate co-precipitation (Graham and Van der Eb, Virology 52:456, 1973; 
Sambrook et al., (1989). Molecular Cloning: A Laboratory Manual, Second ed. Cold 

25 Spring Harbor Laboratory Press, Cold Spring Harbor, 1989) or with Lipofectamine (Life 
Technologies, Gaithersburg, MD). For the calcium phosphate transfections, the 
medium was replaced, prior to transfection, with DMEM containing 5% 
charcoal/dextran-treated FBS. Cells were incubated with the calcium phosphate-DNA 
precipitate for 4-5 hours at 37°C and 5% CO2, washed 3-4 times with DMEM to 

30 remove the precipitate, followed by the addition of C-DMEM. 

Lipofectamine transfections were performed in serum-free DMEM without 
antibiotics according to the manufacturer's instructions (Life Technologies, 
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Gaithersburg, MD). Following a 2-3 hour incubation with the DNA-liposome 
complexes, the medium was removed and replaced with C-DMEM. All transfected 
cells in 96- well microtiter plates were incubated at 33°C and 5% CO2 for 24-48 hours 
prior to drug treatment. Experiments were performed with the receptor expressed 
5 transiently in HeLa cells. 

Dexameihasone induction of GFP-hGR translocation. To obtain receptor- 
ligand translocation kinetic data, nuclei of transfected cells were first labeled with 5 
fig/ml Hoechst 33342 (Molecular Probes) in C-DMEM for 20 minutes at 33*^0 and 5% 
CO2. Cells were washed once in Hank's Balanced Salt Solution (HBSS) followed by 

10 the addition of 100 nM dexamethasone in HBSS with 1% charcoal/dextran-treated 
FBS. To obtain fixed time point dexamethasone titration data, transfected HeLa cells 
were first washed with DMEM and then incubated at 33°C and 5% CO2 for 1 h in the 
presence of 0 - 1000 nM dexamethasone in DMEM containing 1% charcoal/dextran- 
treated FBS. Cells were analyzed live or they were rinsed with HBSS, fixed for 15 min 

15 with 3.7% formaldehyde in HBSS, stained with Hoechst 33342, and washed before 
analysis. The intracellular GFP-hGR fluorescence signal was not diminished by this 
fixation procedure. 

Image acquisition and analysis. Kinetic data were collected by acquiring 
fluorescence image pairs (GFP-hGR and Hoechst 33342-labeled nuclei) from fields of 

20 living cells at 1 min intervals for 30 min after the addition of dexamethasone. 
Likewise, image pairs were obtained from each well of the fixed time point screening 
plates 1 h after the addition of dexamethasone. In both cases, the image pairs obtained 
at each time point were used to defme nuclear and cytoplasmic regions in each cell. 
Translocation of GFP-hGR was calculated by dividing the integrated fluorescence 

25 intensity of GFP-hGR in the nucleus by the integrated fluorescence intensity of the 
chimera in the cytoplasm or as a nuclear-cytoplasmic difference of GFP fluorescence. 
In the fixed time point screen this translocation ratio was calculated from data obtained 
from at least 200 cells at each concentration of dexamethasone tested. Drug-induced 
translocation of GFP-hGR from the cytoplasm to the nucleus was therefore correlated 

30 with an increase in the translocation ratio. 

Results, Figure 20 schematically displays the drag-induced cytoplasm 253 to 
nucleus 252 translocation of the human glucocorticoid receptor. The upper pair of 
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schematic diagrams depicts the localization of GFP-hGR within the cell before 250 (A) 
and after 251 (B) stimulation with dexamethasone. Under these experimental 
conditions, the drug induces a large portion of the cytoplasmic GFP-hGR to translocate 
into the nucleus. This redistribution is quantified by determining the integrated 
5 intensities ratio of the cytoplasmic and nuclear fluorescence in treated 255 and 
untreated 254 cells. The lower pair of fluorescence micrographs show the dynamic 
redistribution of GFP-hGR in a single cell, before 254 and after 255 treatment. The 
HCS is performed on wells containing hundreds to thousands of transfected cells and 
the translocation is quantified for each cell in the field exhibitmg GFP fluorescence. 
10 Although the use of a stably transfected cell line would yield the most consistently 
labeled cells, the heterogeneous levels of GFP-hGR expression induced by transient 
transfection did not interfere with analysis by the cell screening system of the present 
invention. 

To execute the screen, the cell screening system scans each well of the plate, 

15 images a population of cells in each, and analyzes cells individually. Here, two 
channels of fluorescence are used to define the cytoplasmic and nuclear distribution of 
the GFP-hGR within each cell. Depicted in Figure 21 is the graphical user interface of 
the cell screening system near the end of a GFP-hGR screen. The user interface depicts 
the parallel data collection and analysis capability of the system. The windows labeled 

20 ''Nucleus" 261 and "GFP-hGR" 262 show the pair of fluorescence images being 
obtained and analyzed in a single field. The window labeled "Color Overlay" 260 is 
formed by pseudocoloring the above images and merging them so the user can 
immediately identify cellular changes. Within the "Stored Object Regions" window 
265, an image containing each analyzed cell and its neighbors is presented as it is 

25 archived. Furthermore, as the HCS data are being collected, they are analyzed, in this 
case for GFP-hGR translocation, and translated into an inmiediate "hit" response. The 
96 well plate depicted in the lower window of the screen 267 shows which wells have 
met a set of user-defined screening criteria. For example, a white-colored well 269 
indicates that the drug-induced translocation has exceeded a predetermined threshold 

30 value of 50%. On the other hand, a black-colored well 270 indicates that the drug being 
tested induced less than 10% translocation. Gray-colored wells 268 indicate "hits" 
where the translocation value fell between 10% and 50%. Row "E" on the 96 well 
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plate being analyzed 266 shows a titration with a drug known to activate GFP-hGR 
translocation, dexamethasone. This example screen used only two fluorescence 
channels. Two additional channels (Channels 3 263 and 4 264) are available for 
parallel analysis of other specific targets, cell processes, or cytotoxicity to create 
5 multiple parameter screens- 
There is a link between the image database and the information database that is 
a powerful tool during the validation process of new screens. At the completion of a 
screen, the user has total access to image and calculated data (Figure 22). The 
comprehensive data analysis package of the cell screening system allows the user to 

10 examine HCS data at multiple levels. Images 276 and detailed data in a spread sheet 
279 for individual cells can be viewed separately, or summary data can be plotted. For 
example, the calculated results of a single parameter for each cell in a 96 well plate are 
shown in the panel labeled Graph 1 275. By selecting a single point in the graph, the 
user can' display the entire data set for a particular ceil that is recalled from an existing 

15 database. Shown here are the image pair 276 and detailed fluorescence and 
morphometric data from a single cell (Cell #118, gray line 277). The large graphical 
insert 278 shows the results of dexamethasone concentration on the translocation of 
GFP-hGR. Each point is the average of data from at least 200 cells. The calculated 
EC50 for dexamethasone in this assay is 2 nM. 

20 A powerful, aspect of HCS with the cell screening system is the capability of 

kinetic measurements using multicolor fluorescence and morphometric parameters in 
living cells. Temporal and spatial measurements can be made on single cells within a 
population of cells in a field. Figure 23 shows kinetic data for the dexamethasone- 
induced translocation of GFP-hGR in several cells within a single field. Human HeLa 

25 cells transfected with GFP-hGR were treated with 100 nM dexamethasone and the 
translocation of GFP-hGR was measured over time in a population of single cells. The 
graph shows the response of transfected cells 285, 286> 287. and 288 and non- 
transfected cells 289 . These data also illustrate the ability to analyze cells with 
different expression levels. 

30 
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Example 6 High-content screen of drug-induced apoptosis 

Apoptosis is a complex cellular program that involves myriad molecular events 
and pathways. To understand the mechanisms of drug action on this process, it is 
essential to measure as many of these events w^ithin cells as possible with temporal and 

5 spatial resolution. Therefore, an apoptosis screen that requires little cell sample 
preparation yet provides an automated readout of several apoptosis-related parameters 
would be ideal. A cell-based assay designed for the cell screening system has been 
used to simultaneously quantify several of the morphological, organellar, and 
macromolecular hallmarks of paclitaxel-induced apoptosis. 

10 Cell preparation. The cells chosen for this study were mouse connective tissue 

fibroblasts (L-929; ATCC CCL-1) and a highly invasive glioblastoma cell line (SNB- 
19; ATCC CRL-2219) (Welch et al. In Vitro Cell Dev. Biol 31:610, 1995). The day 
before treatment with an apoptosis inducing drug, 3500 cells were placed into each well 
of a 96-well plate and incubated overnight at 37°C in a humidified 5% CO2 

15 atmosphere. The following day, the culture medium was removed from each well and 
replaced with fresh medium containing various concentrations of paclitaxel (0 - 50 
)iM) from a 20 mM stock made in DMSO. The maximal concentration of DMSO used 
in these experiments was 0.25%. The cells were then incubated for 26 h as above. At 
Ihe end of the paclitaxel treatment period, each well received fresh medium containing 

20 750 nM MitoTracker Red (Molecular Probes; Eugene, OR) and 3 ^g/ml Hoechst 33342 
DNA-binding dye (Molecular Probes) and was incubated as above for 20 min. Each 
well on the plate was then washed with HBSS and fixed with 3.7% formaldehyde in 
HBSS for 15 min at room temperature. The formaldehyde was washed out with HBSS 
and the cells were permeabilized for 90 s with 0.5% (v/v) Triton X-100, washed with 

25 HBSS, incubated with 2 U ml'' Bodipy FL phallacidin (Molecular Probes) for 30 min, 
and washed with HBSS. The wells on the plate were then filled with 200 fil HBSS, 
sealed, and the plate stored at A^'C if necessary. The fluorescence signals from plates 
stored this way were stable for at least two weeks after preparation. As in the nuclear 
translocation assay, fluorescence reagents can be designed to convert this assay into a 

30 live cell high-content screen. 

Image acquisition and analysis on the ArrayScan System, The fluorescence 
intensity of intracellular MitoTracker Red, Hoechst 33342, and Bodipy FL phallacidin 
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was measured with the cell screening system as described supra. Morphometric data 
from each pair of images obtained from each well was also obtained to detect each 
object in the image field (e.g.^ cells and nuclei), and to calculate its size, shape, and 
integrated intensity. 

5 Calculations and output A total of 50-250 cells were measured per image 

field. For each field of cells, the following calculations were performed: (1) The 
average nuclear area (fim^) was calculated by dividing the total nuclear area in a field 
by the number of nuclei detected. (2) The average nuclear perimeter (p.m) was 
calculated by dividing the sum of the perimeters of all nuclei in a field by the number 

10 of nuclei detected in that field. Highly convoluted apoptotic nuclei had the largest 
nuclear perimeter values. (3) The average nuclear brightness was calculated by dividing 
the integrated intensity of the entire field of nuclei by the nimiber of nuclei in that field. 
An increase in nuclear brightness was correlated with increased DNA content. (4) The 
average cellular brightness was calculated by dividing the integrated intensity of an 

15 entire field of cells stained with MitoTracker dye by the number of nuclei in that field. 
Because the amount of MitoTracker dye that accumulates within the mitochondria is 
proportional to the mitochondrial potential, an increase in the average cell brightness is 
consistent with an increase in mitochondrial potential. (5) The average cellular 
brightness was also calculated by dividing the integrated intensity of an entire field of 

20 cells stained with Bodipy FL phallacidin dye by the number of nuclei in that field. 
Because the phallotoxins bind with high affinity to the polymerized form of actin, the 
amount of Bodipy FL phallacidin dye that accumulates within the cell is proportional to 
actin polymerization state. An increase in the average cell brightness is consistent with 
an increase in actin polymerization. 

25 Results. Figure 24 (top panels) shows the changes paclitaxel induced in the 

nuclear morphology of L-929 cells. Increasing amounts of paclitaxel caused nuclei to 
enlarge and fragment 293, a hallmark of apoptosis. Quantitative analysis of these and 
other images obtained by the cell screening system is presented in the same figure. 
Each parameter measured showed that the L-929 cells 296 were less sensitive to low 

30 concentrations of paclitaxel than were SNB-19 cells 292. At higher concentrations 
though, the L-929 cells showed a response for each parameter measiured. The 
multiparameter approach of this assay is useful in dissecting the mechanisms of dmg 
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action. For example, the area, brightness, and fragmentation of the nucleus 298 and 
actin polymerization values 294 reached a maximum value when SNB-19 cells were 
treated with 10 nM paclitaxel (Figure 24; top and bottom graphs). However, 
mitochondrial potential 295 was minimal at the same concentration of paclitaxel 

5 (Figure 24; middle graph). The fact that all the parameters measured approached 
control levels at increasing paclitaxel concentrations (>10 nM) suggests that SNB-19 
cells have low affinity dmg metabolic or clearance pathways that are compensatory at 
sufficiently high levels of the drug. Contrasting the drug sensitivity of SNB-19 cells 
297. L-929 showed a different response to pacUtaxel 296 . These fibroblastic cells 

10 showed a maximal response in many parameters at 5 jxM paclitaxel, a 500-fold higher 
dose than SNB-19 cells. Furthermore, the L-929 cells did not show a sharp decrease in 
mitochondrial potential 295 at any of the paclitaxel concentrations tested. This result is 
consistent with the presence of unique apoptosis pathways between a normal and 
cancer cell line. Hierefore, these results indicate that a relatively simple fluorescence 

15 labeling protocol can be coupled with the cell screening system of the present invention 
to produce a high-content screen of key events involved in programmed cell death. 

Background 

A key'^to the mechanism of apoptosis .was the discovery that, irrespective of the 
lethal stimulus, death results in identical apoptotic morphology that includes cell and 
organelle dismanthng and repackaging, DNA cleavage to nucleosome sized firagments, 
and engulfinent of the fi-agmented cell to avoid an inflanmiatory response, Apoptosis is 
therefore distinct fi-om necrosis, which is mediated more by acute trauma to a cell, 
resulting in spillage of potentially toxic and antigenic cellular components into the 
intercellular milieu, leading to an inflammatory response. 

The criteria for determining whether a cell is undergoing apoptosis (Wyllie et 
al. 1980. Int Rev CytoL 68:251-306; Thompson, 1995. Science. 267:1456-62; Majno 
and Joris. 1995. Am J Pathol 146:3-15; Allen et al, 1998. Cell Mol Life ScL 54:427-45) 
include distinct morphological changes in the appearance of the cell, as well as 
alterations in biochemical and molecular markers. For example, apoptotic cells often 
undergo cytoplasmic membrane blebbing, their chromosomes rapidly condense and 
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aggregate aroimd the nucleus ffagmerits, aiid small apbplotic 

bodies are fomiedL: In mmy, but not all, apdptotic cells, chromatin becbiries a target for 
specific nucleases that cIm * 

Apoptdsis "is commonly "acbompani^^^ by a characteristic change iix nuclear 
5 morphblogy (chfoihatih conderisatiori or fragmentation) and a siep^wise fragmentation 
of DNA culminating in the fomSatioii of mono- arid/or oligomeric fragrhehb of ^OO 
base pairs. Specific changes in orgianelljar ftmctiori, such as mitochohdrial membrane 
potential, occur. In addition, specific cysteine proteases (caspases) are activated, which 
catalyzes a highly selective patterh bfpibtbin degradation by proteolytic cleavage after 

10 specific aspartic acid residties. In additioi^ the externial surface exposure of 
phosphatidylserine residues (norihally on the innfer membrane leaflet) Allows ^^fo^^ 
recognition and eliiiunation of apoptotic ce^^ before the membrane breaks lip and 
cytosol or organelles spill into the intercelliilai: space and elicit inflammatory reactions. 
Moreover, cells imdergbirig apdptosis tend to shrink, while also having a reduced 

15 intracellular potassium level/ ^ 

The general patterns of apbptbtic signals are very similiar among different cell types 
and apoptotic inducers. However, the details of the pathways actually vary significantly 
depending on cell type and inducer. The dependence and independmce of various signal 
transduction'^pathways iriyplved in apoptosis are currently topics of intense reseiarch. We 

20 show here that the pathway also varies depending upon the dose of the inducer, in specific 
cell tjT^es. 

Nuclear Morphology 

Cells undergoing apoptosi^ generally exhibit two types pf nuclear change, 
25 fragmentation or condensation ((Majnp and Joris, 1995), (Eamshaw, 1995)). The 

response in a given cell type appears to vary depending on the apoptotic inducCT. 

During nuclear fragmentation, a circular or oval nucleus becomes increasingly Ipbular. 

Eventually, the nucleus frag^nents dramatically into inultiple sub-nuclei. Sometimes the 

density of the chromatin within the lobular nucleus may show spatial variations in 
30 distrihutipn (heterochrpinatization), approximating the margination seen in nuclear 

cpndensation. 
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; - ,>^ Nupk^ been reported in spmjB cell typps, such as MGF-7 

(Saunders et al, 1997. Int J Cancer, 70:214-20). Condensation appears to arise as a 
cpiisequence of the loss of stnxctural integrity, of ^ A nuclear matrix and 

nuclear lamma (Hendzel et al.^ 199^, J .Biol Chem. 273:24470-8). During nuclear 
5 condensation, the chromatin concentrates near the margin of Jhe nucleus, leading to the 
pyeralLshrinkage of .the nucleus. Thus,, the use of nuclear morpholp^^ a measure of 
apoptosis niiist take both conder^ 

Material and Methods 

10 Cells were plated into 96-well plates at densities of 3 x lO^ to 1 x 10"^ cQlls/well. 

The following dajf apoptptic inducers w^ cells 
were jncuba^d for indica^^^^^ 16r3Q houirs). The next day medium 

was removed and cells were stained with 5 |ig/ml Hpechst (Molecular 
fresh medium ^nd incubated for 30 minutes at 3 7° C. Cells were ^^^^ashed in Hank's 

15 Balanced Salt Solution (HBSS)i and -fixed with 3.7% formaldehyde in HBSS at room 
temperature. Cells were washed; 2X with HBSS at room temperature andvthe plate was 
sealed. . - - ^ - ~ , " . 

Quantitation of changes in nuclear morphology upon induction of apoptosis was 
accomplished by (l)''measuring the effective size of the nuclear region; and (2) 

20 measuring the degree of convolution -of the perimeter. The size parameter pro vides the 
more sensitive measure of nuclear condensation^ whereas , the perimeter measure 
provides a- more sensitive measure of nuclear fragmentation. 

Results & Discussipii 

25 L929 cells responded to both staurpspprine (30 hours) and paclitaxel (30 hotu-s) 

with a dose-dependent change in nuclear morphology (Kg 25A and 25B). TSHK. cdls 
illustrated a slightly more, cpmplicated, yet clearly visible response. Staurpsporine 
appeared to stimulate nuclear condensation at lower doses apd, nuclear fragmentation. at 
higher doses (Fig 25C and 25D). In contrast, paclitaxel induced a consistent increase in 

30 nuclear fragni with increasing concentrati^ ^pie re?5>pnse. o^ cells 

varied dramaticaUy depending upon the appptotic inducer. Staurpspprine appeared to 
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elicit nuclear condensation whereas paclitaxel induced nuclear fragmentation (Fig 25E 
and 25F). 

Figure 26 illustrates the dose response of cells in terms of both nuclear size and 
nuclear perimeter convolution. There appears to be a swelling of the nuclei that 
5 precedes the fragmentation. 

Result of evaluation: Differential responses by cell lines and by apoptotic 
inducers were observed in a dose dependent maimer, indicating that this assay will be 
useful for detecting changes in the nucleus characteristic of apoptosis. 

10 Actin reorganization 

We assessed changes in the actin cytoskeleton as a potential parameter related 
to apoptotic changes. This was based on preliminary observations of an early increase 
in f-actin content detected with fluorescent phalloidin labeling, an f-actin specific stain 
(our unpubhshed data; Levee et al. 1996. Am J Physiol. 271:C1981-92; Maekawa et al. 

15 1996. Clin Exp Immunol. 105:389-96). Changes in the actin cytoskeleton during 
apoptosis have not been observed in all cell types. (Endresen et al. 1995. Cytometry. 
20:162-71, van Engeland et al. 1997. Exp Cell Res, 235:421-30). 
Material and Methods 

Cells were plated in 96-well plates at densities of 3 x 10^ to 1 x 10"^ cells/well. 

20 The following day apoptotic inducers were added at indicated concentrations. Cells 
were incubated for the indicated time periods (usually 16-30 hours). The next day the 
medium was removed and cells were stained with 5 p-g/ml Hoechst (Molecular Probes, 
Inc.) in fresh mediiun and incubated for 30 minutes at 30°C. Cells were washed in 
HBSS and fixed with 3.7% formaldehyde in HBSS at room temperature. Plates were 

25 washed with HBSS and permeabilized with 0.5% v/v Triton X- 100 in HBSS at room 
temperature. Plates were washed in HBSS and stained with 100 fil of lU/ml of Alexa 
488 Phalloidin stock (100 fil/well. Molecular Probes, Inc.). Cells were washed 2X with 
HBSS at RT and the plate was sealed. 

Quantitation of f-actin content was accomplished by measuring the intensity of 

30 phalloidin staining around the nucleus. This was determined to be a reasonable 

approximation of a fiill cytoplasmic average of the intensity. The mask used to 

approximate this cytoplasmic measure was derived from the nuclear mask defined by 
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the Hoechst stain. Derivation was accomplished by combinations of erosions and 
dilations. 

Results and Discussion 
5 Changes in f-actin content varied based on cell type and apoptotic inducer (Fig 

27). Staurosporine (30 hours) induced increases in f-actin in L929 (Fig. 27A) and BHK 
(Fig. 27B) cells. MCF-7 cells exhibited a concentration-dependent response. At low 
concentrations (Fig. 27E) there appeared to be a decrease in f-actin content. At higher 
concentrations, f-^actin content increased. Paclitaxel (30 hours) treatment ltd to a wide 
10 variety of responses. L929 cells responded with graded increases in f-actin (Fig. 27B) 
whereas both BHK and MCF-7 responses were highly variable (Figs. 27D & 27F, 
respectively). 

Result of Evaluation: Both increases and decreases in signal intensity were 
15 measured for several cell lines and foimd to exhibit a concentration dependent 
response. For certain cell line/apoptotic inducer pairs this could be a statistically 
significant apoptotic indicator. 

Changes in Mitochondrial Mass/Potential 
20 Introduction 

Changes in mitochondria play a central role in apoptosis (Henkart and 
Grinstein. 1996. J Exp Med. 183:1293-5). Mitochondria release ^optogenic factors 
through the outer membrane and dissipate the electrochemical gradient of the inner 
membrane. This is thought to occur via formation rof the mitochondria permeability 

25 transition (MPT), although it is apparently not tme in all cases. An obvious 
manifestation of the formation of the MPT is collapse of the mitochondrial membrane 
potential. Inhibition of MPT by pharmacological intervention or mitochondrial 
expression of the anti-apoptotic protesin Bcl-2 prevents cell deaths suggesting the 
formation of the MPT may be a rate-limiting event of! the death process (For review 

30 see: Kroemer et 4.4998. AnnwRev. B^^^^ 60:619r42). It has also been observedjthat 
mitochondria can proliferate during stimulation of apoptosis (Mancini et al. 1997. J 
Cell Biol. 1 38 :449.69; Camilleri-Broet et al. 1 998. Exp Cell Res. 239:277-92). = 
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One approach for measuring apoptosis-induced changes in mitochondria is to 
measure the mitochondrial membrane potential. Of the methods available, the simplest 
measure is the redistribution of a cationic dye that distributes within intracellular 
organelles based on the membrane potential. Such an approach traditionally requires 
5 live cells for the measurements. The recent introduction of the MitoTracker dyes (Foot 
et al. 1997. Cytometry. 27:358-64; available from Molecular Probes, Inc., Oregon) 
provides a means of measuring mitochondrial membrane potential after fixation. 

Given the observations of a possible increase in mitochondrial mass during 
apoptosis, the amoimt of dye labeling the mitochondria is related to both membrane 
10 potential and the number of mitochondria. If the number of mitochondria remains 
constant then the amount of dye is directly related to the membrane potential. If the 
number of mitochondria is not constant, then the signal will likely be dominated by the 
increase in mass (Reipert et al. 1995. Exp Cell Res. 221:281-8). 

Probes are available that allow a clear separation between changes in mass and 
15 potential in HCS assays. Mitochondrial mass is measured directly by labeling with 
Mitotracker Green FM (Poot and Pierce, 1999, Cytometry, 35:311-7; available from 
Molecular Probes, Inc., Oregon). The labeling is independent of mitochondrial 
membrane potential but proportional to mitochondrial mass. This also provides a 
means of normalizing other mitochondrial measures in each cell with respect to 
20 mitochondrial mass. 

Material and Methods 

Cells were plated into 96-well plates at densities of 3 x 10^ to 1 x lO'^ cells/well. 
The following day apoptotic inducers were added at the indicated concentrations and 

25 cells were incubated for the indicated time periods (usually 16-30 hours). Cells were 
stained with 5 jig/ml Hoechst (Molecular Probes, Inc.) and 750 nM MitoTracker Red 
(CMXRos, Molecular Probes, Inc.) in fresh medium and incubated for 30 minutes at 
37^C. Cells were washed in HBSS and fixed with 3.7% formaldehyde in HBSS at room 
temperature. Plates were washed with HBSS and permeabilized with 0.5% v/v Triton 

30 X-100 in HBSS at room temperature. Cells were washed 2X with HBSS at room 
temperature and the plate was sealed. For dual labeling of mitochondria, cells were 
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treated with 200 nM Mitotracker Green and 200 nM Mitotracker <Red for 0.5 hours 
before fixation. 

Results & Discussion 

5 induction of appptosis by, staurosporine and paclitaxel led to varying 

niitochondrial changes depending upon the stimulus. L929 cells exhibited a clear 
increase in mitochondrial mass with increasing staurospprine cpnpentrations (Fig. 28). 
BHK cells exhibited either a decrease in membrane potential at lower concentrations of 
staiu-psporine, or an increase in mass at higher concentrations of staurosporihe ;(Fig. 

10 28G). MCF-? cells responded by a consij^tent decrease in mitochondrial membrane 
potential in response , to increasing epnqentratioiuit of staurosporine^^^(^^ 28E). 
Increasing concentrations of^rpaclitaxel caused consistent increases in mitochondrial 
mass (Fig 28B, 28D, and 28F). ^ K 

The mitochondrial membrane potential is measured by labeling mitochondria 

15 with bpth- Mitotracker Green FM and Mitpte^^ (Molecular Probes, Inc). 

Mitotracker Red labeling is proportional to both mass and membrane potential. 
Mitotracker Green FM labeling^s proportional to niass. The ratio, of Mitotracker Red 
signal to . the Mitotracker Green; ^FM signal prpyides a nieasure of mitochondrial 
membrane potential (Ppot and Pierce, ;1999t): This ratio normalizes the mitochondriail 

20 mass with respect, to the Mitotracker Red signal. (See Figure 28G) Combining the 
ability to nomialize to mitochondrial mass with a measure of the membrane potential 
allows independent assessment of both parameters. 

Result of Evaluation: Both decreases in pptential and increases in mass were observed 
25 depending on the cell line and inducer tested. Dose dependent correlation demonstrates 

that this is a promising apoptotic indicator. . 

It is , possible to combine niultiple measures of apoptpsis by exploiting the 

spectral domain of fluorescence spectroscopy. In fact, all of the nuclejar morphplogy/f^ 

actin content/mitochondrial mass/niitqchondrial potential data shown earlier were 
30 collected as multip^amete^^ assays, but were presented indiyi^^ 
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Example Z Protease induced translocation of a signaling enzyme containing a 
disease-associated sequence from cytoplasm to nucleus. 

Plasmid construct, A eukaryotic expression plasmid containing a coding 
5 sequence for a green fluorescent protein - caspase (Cohen (1997), Biochemical J. 
326:1-16; Liang et al. (1997), J. ofMolec. Biol 274:291-302) chimera is prepared using 
GFP mutants. The construct is used to transfect eukaryotic cells. 

Cell preparation and transfection. Cells are trypsinized and plated 24 h prior 
to transfection and incubated at 37'*C and 5% CO2. Transfections are performed by 
10 methods including, but not limited to calcium phosphate coprecipitation or lipofection. 
Cells are incubated with the calcium phosphate-DNA precipitate for 4-5 hours at 37°C 
and 5% CO2, washed 3-4 times with DMEM to remove the precipitate, followed by the 
addition of C-DMEM. Lipofectamine transfections are performed in serum-free 
DMEM without antibiotics according to the manufacturer's instructions. Following a 
15 2-3 hour incubation with the DNA-liposome complexes, the medium is removed and 
replaced with C-DMEM. 

Apopototic induction of Caspase-GFP translocation. To obtain Caspase-GFP 
translocation kinetic data, nuclei of transfected cells are first labeled with 5 jag/ml 
Hoechst 33342 (Molecular Probes) in C-DMEM for 26 minutes at 37°C and 5% CO2. 
20 Cells are washed once in Hank's Balanced Salt Solution (HBSS) followed by the 
addition of compounds that induce apoptosis. These compounds include, but are not 
limited to paclitaxel, staurosporine, ceramide, and tumor necrosis factor. To obtain 
fixed time point titration data, transfected cells are first washed with DMEM and then 
incubated at 37°C and 5% CO2 for 1 h in the presence of 0 - 1000 nM compound in 
25 DMEM. Cells are analyzed live or they are rinsed with HBSS, fixed for 15 min with 
3.7% formaldehyde in HBSS, stained with Hoechst 33342, and washed before analysis. 

Image acquisition and analysis. Kinetic data are collected by acquiring 
fluorescence image pairs (Caspase-GFP and Hoechst 33342-labeled nuclei) from fields 
of living cells at 1 min intervals for 30 min after the addition of compound. Likewise, 
30 image pairs are obtained from each well of the fixed time point screening plates 1 h 
after the addition of compound. In both cases, the image pairs obtained at each time 
point are used to define nuclear and cytoplasmic regions in each cell. Translocation of 
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Caspase-GFP is calculated by dividing the integrated fluorescm^ intensity, of Caspase- 
GFP in the nucleus by the integrated fluorescence intensity of the chimera in the 
cytbplasiri or as^a 'nuclear-cj^opl^ difference of GFP^fluorescencer In the fixed 
time point screen this trmsipca^^ is calcidated from date obtome^^ least 

200 cells at each coricMiration W cbinpburid tested. Dhig-ihduced transloGatioii of 
Caspase-GFP from the cytoplasm to the nucleus is therefore coirelated with an increase 
in the translocation ratio. Molecular interaction libraries including, but not Umited to 
those coniprising put^^ sictivMtbrs or iniiibitbi^ of ap^bptbsis enzymes are 

use to screen the indicator cell lines and identify a specific ligand for the DAS, and a 
pa&way activated by cbmp^^^ 

Example 8, Identification of nqyel steroid receptors from DAS 

Two sources of material and/or information are required to make use of this 
embodiment, wliich allows assessment of the fimction of an uncharacterized gene. 
15 First, disease associated sequence baiik(sj containing cDNA sequences suitable for 
transfection into mammaLlian cells can be used. BecsLuse every RADE or differential 
expression experiment generates up to several hundred sequences, it is possible to 
geiierate an ample supply of DAS. Second, information from primaiy sequence 
database searches can be used to place DAS into brbad^categories, including, but not 
20 limited to, those tha:f' contain signal sequences, seven trans-membrane motifs, 
cbriserved protease active site domains; or other identifiable motifs. Based bh the 
information acqiiired 'from these sources, ihethod types and indicator cell lines to be 
trahsfected are selected. A large number of motifs are already well characterized iand 
encoded in the linear seqiiences contained within the large number genes in existing 
25 genomic databases. 

In one embodiment, the following steps aire taken: 

1) Information fi-om the DAS identification experiment (including database 
searches) is used as the basis for selecting the relevant biological processes, (for 
exairiple, look DAS frorn a tiimor line for cell cycle modulation, apoptosis, 

30 rnetastaticipro^ . „ 

2) Sorting of DNA sequences or DAS by identifiable motifis (ie. signal 
sequences, 7-. transmembrane domains, conserved protease active site doinains, etc.) 
This initial grouping will determine fluorescent tagging strategies, host cell lines, 
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indicator cell lines, and banks of bioactive molecules to be screened, as described 
supra, 

3) Using well established molecular biology methods, ligate DAS into an 
expression vector designed for this purpose. Generalized expression vectors contain 
promoters, enhancers, and terminators for which to deliver target sequences to the cell 
for transient expression. Such vectors may also contain antibody tagging sequences, 
direct association sequences, chromophore fusion sequences like GFP, etc. to facilitate 
detection when expressed by the host. 

4) Transiently transfect cells with DAS containing vectors using standard 
transfection protocols including: calcium phosphate co-precipitation, liposome 
mediated, DEAE dextran mediated, polycationic mediated, viral mediated, or 
electroporation, and plate into microtiter plates or microwell arrays. Alternatively, 
transfection can be done directly in the microtiter plate itself 

5) Carry out the cell screening methods as described supra. 

In this embodiment, DAS shown to possess a motif(s) suggestive of 
transcriptional activation potential (for example, DNA binding domain, amino terminal 
modulating domain, hinge region, or carboxy terminal ligand binding domain) are 
utilized to identify novel steroid receptors. 

Defining the fluorescent tags for this experiment involves identification of the 
nucleus through staining, and tagging the DAS by creating a GFP chimera via insertion 
of DAS into an expression vector, proximally fused to the gene encoding GFP. 
Alternatively, a single chain antibody fragment with high affinity to some portion of the 
expressed DAS could be constmcted using technology available in the art (Cambridge 
Antibody Technologies) and linked to a fluorophore (FITC) to tag the putative 
transcriptional activator/receptor in the cells. This alternative would provide an 
external tag requiring no DNA transfection and therefore would be useful if distribution 
data were to be gathered from the original primary cultures used to generate the DAS. 

Plasmid construct, A eukaryotic expression plasmid containing a coding 
sequence for a green fluorescent protein - DAS chimera is prepared using GFP 
mutants. The construct is used to transfect HeLa cells. The plasmid, when transfected 
into the host cell, produces a GFP fiised to the DAS protein product, designated GFP- 
DASpp. 
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Cell preparation and transfectioju HeLa cells are trypsinized and plated using 
DMEM containing 5% charcoal/dextran-treated fetal bovine seram (FBS) (Hyclone) 
and 1% penicillin-streptomycin (C-DMEM) 12-24 hours prior to transfection and 
incubated at 37°C and 5% CO2 . Transfections are performed by calcium phosphate 

5 coprecipitation or with Lipofectamine (Life Technologies). For the calcium phosphate 
transfections, the medium is replaced, prior to transfection, with DMEM containing 5% 
charcoal/dextran-treated FBS. Cells are incubated with the calcium phosphate-DNA 
precipitate for 4-5 hours at 37°C and 5% CO2, and washed 3-4 times with DMEM to 
remove the precipitate, followed by the addition of C-DMEM. Lipofectamine 

10 transfections are performed in serum-free DMEM without antibiotics according to the 
manufacturer's instructions. Following a 2-3 hour incubation with the DNA-liposome 
complexes, the medium is removed and replaced with C-DMEM. All transfected cells 
in 96-well microliter plates are incubated at 33°C and 5% CO2 for 24-48 hours prior to 
drug treatment. Experiments are performed with the receptor expressed transiently in 

15 HeLa cells. 

Localization of expressed GFP-DASpp inside cells. To obtain cellular 
distribution data, nuclei of transfected cells are first labeled with 5 fig/ml Hoechst 
33342 (Molecular Probes) in C-DMEM for 20 minutes at 33*'C and 5% CO2. Cells are 
washed once in Hank's Balanced Salt Solution (HBSS). The cells 1are analyzed live or 
20 they are rinsed with HBSS, fixed for 15 min with 3.7% formaldehyde in HBSS, stained 
with Hoechst 33342, and washed before analysis. 

In a preferred embodiment, image acquisition and analysis are performed using 
the cell screening system of the present invention. The mtracellular GFP-DASpp 
fluorescence signal is collected by acquiring fluorescence image pairs (GFP-DASpp 
25 and Hoechst 33342-labeled nuclei) fi-om field cells. The image pairs obtained at each 
time point are used to define nuclear and cytoplasmic regions in each cell. Data 
demonstrating dispersed signal in the cytoplasm would be consistent with known 
steroid receptors that are DNA transcriptional activators. 

Screening for induction of GFP-DASpp translocation. Using the above 
30 construct, confirmed for appropriate expression of the GFP-DASpp, as an indicator cell 
line, a screen of various ligands is performed using a series of steroid type ligands 
including, but not limited to: estrogen, progesterone, retinoids, growth factors, 
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androgens, and many other steroid and steroid based molecules. Image acquisition and 
analysis are performed using the cell screening system of the inveritibh. The 
intracellular GFP-DASpp , fluorescence signal is collected by acquiring fluorescence 
image pairs. (GFP-DASpp and Hoechst 33342-labeled nuclei) from fields cdlls. The 
image pairs obtained , at each time point are used to define nuclear and cytoplasmic 
regions Jn each . cell. Translocation of GFP-DASpp is calculated by dividing •'the 
integrated fhxorescence iiit«isity of G^-DAS^ in the^ nucleus by the iirtb^at^ 
fluorescence intensity of the clum?rk in the cytoplasm or as a nucld^^ 
difference of GFP fluorescence. A transloc^on from the cytoplasm into the nucleiis 
indicates a ligand biiiding .activation of the DASpp thus identifying the potykial 
receptor class and action. Combinmg this data with other date obtained in amilk 
fashion using known inhibitors and modifiers of steroid receptors, would either validate 
the DASpp as a tso-get, or more data would be generated from various sources. 



15 Example 9 Additional Screens 

Translocation between the plasina membrane a^^ 

Profilactin, complex dissociation and binding of profflm to the plasma 
membrane. In one embodiment, a fluorescent protein biosensor of profiUn membrafae 
binding is prepared by labeling purified profilin (Federov et 31.(1^9^); J. Molec. Biol. 

20 241:480-^482; Lanbrechts et al. (\995), Eur. J. Biochem. 230:^8il-286) with a probe 
possessing a fluorescence lifetime in the range of 2-300 ns. The labeled profiliri is 
introduced into living indicator cells using bulk loading methodology arid tlie indicator 
cells are treated with test compounds. Fluorescence anisotropy imaging iiiicroscopy 
(Gough-and Taylor (1993), J. Cell Biol. 121:1095-1107) is used to measure test- 

25 compound dependent movement of the fluorescent derivative of profilin betWeen the 
cytoplasm and membrane for a period of time after treatment rariging from 0.1 s to 10 
h. 

Rho-RhoGDI complex translocatipn to the membrane, In another 

embodiment, indicator cells are treated with test compounds and then fixed, washed, 

30 and permeabilizedv' The indicator cell pl^^nia^mei^brane, . cytopl^OTj, and nucleus are 

all labeled with distinctly colored markers followed by iirununolocaliza^on of Rho 

protein (Self et al. {\99S)i Methods in Enzymology 256:3-10; Tanaka et al. (1995), 
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Methods in Enzymology 256:41-49) with antibodies labeled with a fourtii colon Each 
of tiie four labels is imaged separately using the cell screening system, and the images 
used to calculate the amount of inhibition or activation of translocation effected by the 
test compound. To do this calculation, iflie images of the probes used to mark the 
5 plasma membrane and cytoplasm are used to mask the image of the immunp logical 
probe marking the jocation of intracellular Rho jprptein. The integrated brightness p^er 
unit area under each niask is use4 to form a translocation quotient by .dividing the 
plasma membrane integrated brightness/area by, the cytoplasmic integrated 
brightness/area. By comparing the translocation quqtierit values frpin control m 
10 experimental wells, the percent calculated for each potential lea<i 
compound. 

^'Arrestin translocation to the plasma membrane upon G-protein receptor activation. 

In another embodiment of a cytoplasm to membrane translocation high-content 

15 screen, the tr^location of p-arrestm ptotein from the cytoplasm to the plasma 
meiribi-ane is itieasured in response to cell^ fr^ To measure the tr^lbc^^ 

living indicator cells containihg luniinescent domi^^^ markers iure' tfSatcK^ With test 
cbmpouiids an<^ the movement of the P-£trrestiri m:ar^keris measured iii time md space 
using the cell screening system of the present 'invention. In d preferred emboidiment, 

20 the indicator cells contain liimineScent markers consistirig^of a green fluorescent protein 
p-arrestih (GIT-p-arr)Dstiri) pr^ chimera (Barak et all (1991), J. BidL Chem, 
272:27497-27500; Dkaka et ^. (1998), J, Biol CAe^£ 273:685-688) that is expressed 
by the liidibitof cells throiigh the use of transieiit or stable cell transfectibii and dthCT 
reporters used to mark cytoplasmic and mernbrahe domains. When the indicator ceUs 

25 are in the resting state, the domatin marker molecules partition prddoniinately in the 
plasma membfaiie of iii the cytbpilasm. In the high-content scfeiein, these markers are 
used to delineate the cell cytojpiasm aiiid plasrna iridnibfiaiie iii distinct chMiels of 
fluorescence. When the indicator cells are treated with a test corhpound, the dynamic 
redistribution of the GFP-P-arfestin is recorded as a series of inaiages over a tiniis scale 

30 ranging from 0.i s to 10 h. In a preferred embodiment, the tinie scale is 1 h. Each 
image is analyzed By a method that quantifies the movenient of the GFP-P-arrestin 
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protein chimera between the plasma membrane and the cytoplasm. To do this 
calculation, the images of the probes used to mark the plasma membrane and cytoplasm 
are used to mask the image of the GFP-p-arrestin probe marking the location of 
intracellular GFP-p-arrestin protein. The integrated brightness per unit area under each 
5 mask is used to form a translocation quotient by dividing the plasma membrane 
integrated brightness/area by the cytoplasmic integrated brightness/area. By comparing 
the translocation quotient values from control and experimental wells, the percent 
translocation is calculated for each potential lead compound. The output of the higji- 
content screen relates quantitative data describing the magnitude of the translocation 
10 within a large number of individual cells that have been treated with test compounds of 
interest. 

Translocation between the endoplasmic reticulum and the Golgi: 

In one embodiment of an endoplasmic reticulum to Golgi translocation high- 
content screen, the translocation of a VSVG protein from the ts045 mutant strain of 

15 vesicular stomatitis virus (Ellenberg et al. (1997), J. Cell Biol 138:1193-1206; Presley 
et al. (1997) Nature 389:81-85) from the endoplasmic reticulum to the Golgi domain is 
measured in response to cell treatment. To measure the translocation, indicator cells 
containing luminescent reporters are treated with test compounds and the movement of 
the reporters is measured in space and time using the cell screening system of the 

20 present invention. The indicator cells contain luminescent reporters consisting of a 
GFP-VSVG protein chimera that is expressed by the indicator cell through the use of 
transient or stable cell transfection and other domain markers used to measure the 
localization of the endoplasmic reticulum and Golgi domains. When the indicator cells 
are in their resting state at 40°C, the GFP-VSVG protein chimera molecules are 

25 partitioned predominately in the endoplasmic reticulum. In this high-content screen, 
domain markers of distinct colors used to delineate the endoplasmic reticulum and the 
Golgi domains in distinct chaimeis of fluorescence. When the indicator cells are treated 
with a test compound and the temperature is simultaneously lowered to 32°C, the 
dynamic redistribution of the GFP-VSVG protein chimera is recorded as a series of 
.30 images over a time scale ranging from 0.1 s to 10 h. Each image is analyzed by a 
method that quantifies the movement of the GFP-VSVG protein chimera between the 
endoplasmic reticulum and the Golgi domains. To do this calculation, the images of 
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the probes used to mark the endoplasmic reticulum and the Golgi domains are used to 
mask the image of the GFP-VSVG probe marking the location of intracellular GFP- 
VSVG protein. The integrated brightness per unit area under each mask is used to form 
a translocation quotient by dividing the endoplasmic reticulum integrated 
5 brightness/area by the Golgi integrated brightness/area. By comparing the translocation 
quotient values from control and experimental wells, the percent translocation is 
calculated for each potential lead compound. The output of the high-content screen 
relates quantitative data describing the magnitude of the translocation within a large 
nximber of individual cells that have been treated with test compounds of interest at 
10 final concentrations ranging from 10"^^ M to 10"^ M for a period ranging from 1 min to 
10 h. 

Induction and inhibition of organellar function: 
Intracellular microtubule stability. 

15 In another aspect of the invention, an automated method for identifying 

compounds that modify microtubule structure is provided. In this embodiment, 
indicator cells are treated with test compounds and the distribution of luminescent 
microtubule-labelirig molecules is measured in space and time using a cell screening 
system, such as the one disclosed above. The luminescent microtubule-labeling 

20 molecules may be expressed by or added to the cells either before, together with, or 
after contacting the cells with a test compound. 

In one embodiment of this aspect of the invention, living cells express a 
luminescently labeled protein biosensor of microtubule dynamics, comprising a protein 
that labels microtubules fused to a luminescent protein. Appropriate microtubule- 

25 labeling proteins for this aspect of the invention include, but are not limited to a and p 
tubulin isoforms, and MAP4. Preferred embodiments of the luminescent protein 
include, but are not limited to green fluorescent protein (GFP) and GFP mutants. In a 
preferred embodiment, the method involves transfecting cells with a microtubule 
labeling luminescent protein, wherein the microtubule labeling protein can be, but is 

30 not limited to, a-tubulin, P-tubulin, or microtubule-associated protein 4 (MAP4). The 
approach outlined here enables those skilled in the art to make live cell measurements 
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to determine the effect of lead compounds on tubulin activity and microtubule stability 
in vivo. 

In a most preferred embodiment, MAP4 is fused to a modified version of the 
Aequorea victoria green fluorescent protein (GFP). A DNA construct has been made 
5 which consists of a fusion between the EGFP coding sequence (available from 
Clontech) and the coding sequence for moxise MAP4. (Olson et aL, (1995), J. Cell 
Biol. 130(3): 639-650). MAP4 is a ubiquitous microtubule-associated protein that is 
known to interact with microtubules in interphase as well as mitotic cells (Olmsted and 
Murofiishi, (1993), MAP4. In "Guidebook to the Cytoskeleton and Motor Proteins." 

10 Oxford University Press. T. Kreis and R. Vale, eds.) Its localization, then, can serve as 
an indicator of the localization, organization, and integrity of microtubules in living (or 
fixed) cells at all stages of the cell cycle for cell-based HCS assays. While MAP2 and 
tau (microtubule associated proteins expressed specifically in neuronal cells) have bem 
used to form GFP chimeras (Kaech et aL, (1996) Neuron. 17: 1189-1199; Hall et aL, 

15 (1997), Proc. Nat. Acad. Sci. 94: 4733-4738) their restricted cell type distribution and 
the tendency of these proteins to bundle microtubules when overexpressed make these 
proteins less desirable as molecular reagents for analysis in live cells originating from 
varied tissues and organs. Moderate overexpression of GFP-MAP4 does not dismpt 
microtubule function or integrity (Olson et aL, 1995), Similar constructs can be made 

20 using P-tubulin or a-tubulin via standard techniques in the art. These chimeras will 
provide a means to observe and analyze microtubule activity in living cells during all 
stages of the cell cycle. 

In another embodiment, the luminescently labeled protein biosensor of . 
microtubule dynamics is expressed, isolated, and added to the cells to be analyzed via 

25 bulk loading techniques, such as microinjection, scrape loading, and impact-mediated 
loading. In this embodiment, there is not an issue of overexpression within the cell, 
and thus a and p tubulin isoforms, MAP4, MAP2 and/or tau can all be used. 

In a further embodiment, the protein biosensor is expressed by the cell, and the 
cell is subsequently contacted with a luminescent label, such as a labeled antibody, that 

30 detects the protein biosensor, endogenous levels of a protein antigen, or both. In this 
embodiment, a luminescent label that detects a and P tubulin isoforms, MAP4, MAP2 
and/or tau, can be used. 
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A variety of GFP mutants are available, all of which would be effective in this 
invention, including, but not limited to, GFP mutants which are commercially available 
(Clontech, California). 

The MAP4 construct has been introduced into several mammalian cell lines 
5 (BHK-21, Swiss 3T3, HeLa, HEK 293, LLCPK) and the organization and localization 
of tubulin has been visualized in live cells by virtue of the GFP fluorescence as an 
indicator of MAP4 localization. The construct can be expressed transiently or stable 
cell lines can be prepared by standard methods. Stable HeLa cell lines expressing the 
EGFP-MAP4 chimera have been obtained, indicating that expression of the chimera is 

10 not toxic and does not interfere with mitosis. 

Possible selectable markers for establishment and maintenance of stable cell 
lines include, but are not limited to the neomycin resistance gene, hygromycin 
resistance gene, zeocin resistance gene, puromycin resistance gene, bleomycin 
resistance gene, and blastacidin resistance gene. 

15 The utility of this method for the monitoring of microtubule assembly, 

disassembly, and rearrangement has been demonstrated by treatment of transiently and 
stably transfected cells with microtubule dmgs such as paclitaxel, nocodazole, 
vincristine, or vinblastine. 

The present method provides high-content and combined high throughput-high 

20 content cell-based screens for anti-microtubule drugs, particularly as one parameter in a 
multi-parametric cancer target screen. The EGFP-MAP4 construct used herein can also 
be used as one of the components of a high-content screen that measures multiple 
signaling pathways or physiological events. In a preferred embodiment, a combined 
high throughput and high content screen is employed, wherein multiple cells in each of 

25 the locations containing cells are analyzed in a high throughput mode, and only a subset 
of the locations containing cells are analyzed in a high content mode. The high 
throughput screen can be any screen that would be useful to identify those locations 
containing cells that should be further analyzed, including, but not limited to, 
identifying locations with increased luminescence intensity, those exhibiting 

30 expression of a reporter gene, those undergoing calcium changes, and those 
undergoing pH changes. 
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> ; In addition to drug screening applications, the present invention may be applied 
vto clinicali diagnostics, the detection lof chemical and biological warfare weapons, and 
the basic research market since fundamental cell processes, such as cell divisidii and 
motility, are highly dependent upon microtubule dynamics. ^ 

Image Acquisition and Anaty^ 

rimage data.can be obtained from either fixed or living indicator cells. To 
extract morphometric data from each of the images obtained the following method of 
analysis is used: ^ ^ 

10 1 . Threshold each nucleus and cytoplasmic image' to produce a mask that has value = 
: 0 for e£tch pixel outside a nucleus or cell boundary. 

2. Overlay the mask on the original image^ detect each object in the field (i.e., nucleus 
or cell), and calculate its size, shape, and integrated intensity, 

3. Overlay the whole cell mask obtained above on the corresponding luminescent 
15 microtubule image and apply one or more of the following set of classifiers to 

deteraiine the micrto tubule morphology and the effect of drugs on microtubule 
morphology. ' [y .^' -^ ■' n-,-,- 

Microtubule morphology is defined using a set of classifiers to quantify aspects 
of ^ microtubule shape, size, aggregation state, - and polymerization state. These 
20 classifiers can be based on approaches that include co-occurrence matrices, texture 
measurements, spectral methods^ stnictural methods, wavelet transforms, statistical 
methods, or combinations thereof^ Examples of such classifiers are as follows: 

1. A classifier to quantify microtubule length and width using edge 
detection niethods such as that discussed in Kolega et al. ((1993), Biolmaging 1:136- 

25 1 50), wkicli discloses a non-automaLted method to determine edge stf ength in individual 
cells), to calculate the total edge strength within each cell; To normalize for cell size, 
the total edge strength can be divided by the cell area to give a "microtubule 
ihbrptiology" va^ rmcrbtubule ir^^ arfe associated with strong 

edge strength values and are therefore maximal in cells containing distinct microtubule 

30 stmctures. Likewise, small microtubule morphology values are associated with weak 
edge stfehgtli and are minimal m cells with depolyriierized micrb^ The 
physiological range of microtubule morphology vaJues is set by treating cells with 
edtiier the microtubule s^^ paclitaxel (10 jilyQ or the niicrotubule 

depolymeriahg dnig ndcddaiMle (1 0 fxg/riil). 

35 

2. A classifier to quantify microtubule aggregation into punctate spots or 
foci using methodology from the receptor internalization methods discussed supra. 
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3. A classifier to quantify microtubule ciepolymerization using a measure 
of image texture. 

4. A classifier to quantify apparent interconnectivity, or branching (or 
both), of the microtubules. 

5. Measurement of the kinetics of microtubule reorganization using the 
above classifiers on a time series of images of cells treated with test compounds. 

In a further aspect, kits are provided for analyzing microtubule stability, 
comprising an expression vector comprising a nucleic acid that encodes a microtubule 
labeling protein and instructions for using the expression vector for carrying out the 
methods described above. In a preferred embodiment, the expression vector further 
comprises a nucleic acid that encodes a luminescent protein, wherein the microtubule 
binding protein and the luminescent protein thereof are expressed as a fusion protein. 
Alternatively, the kit may contain an antibody that specifically binds to the 
microtubule-labeling protein. In a fiirther embodiment, the kit includes cells that 
express the microtubule labeling protein. In a preferred embodiment, the cells are 
transfected with the expression vector. In another preferred embodiment, the kits 
further contain a compound that is known to disrupt microtubule structure, including 
but not limited to curacin, nocodazole, vincristine, or vinblastine. In another preferred 
embodiment, the kits further comprise a compound that is known to stabilize 
microtubule structure, including but not limited to taxol (paclitaxel), and 
discodermolide. 

In another aspect, the present invention comprises a machine readable storage 
medium comprising a program containing a set of instructions for causing a cell 
screening system to execute the disclosed methods for analyzing microtubule stability, 
wherein the cell screening system comprises an optical system with a stage adapted for 
holding a plate containing cells, a digital camera, a means for directing fluorescence or 
luminescence emitted fi-om the cells to the digital camera, and a computer means for 
receiving and processing the digital data fi"om the digital camera. 
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High-content screens involving the functional localization of macromolecules 

Within this class of high-content screen, the functional localization of 
macromolecules in response to external stimuli is measured within living cells. 

Glycolytic enzyme activity regulation. In a preferred embodiment of a 
cellular enzyme activity high-content screen, the activity of key glycolytic regulatory 
enzymes are measured in treated cells. To measure enzyme activity, indicator cells 
containing luminescent labehng reagents are treated with test compounds and the 
activity of the reporters is measured in space and time using cell screening system of 
the present invention. 

In one embodiment, the reporter of intracellular enzyme activity is fructose-6- 
phosphate, 2-kinase/fractose-2,6-bisphosphatase (PFK-2), a regulatory enzyme whose 
phosphorylation state indicates intracellular carbohydrate anabolism or catabolism 
(Deprez et al. (1997) J. BioL Chem, 272:17269-17275; Kealer et al. (1996) FEBS 
Letters 395:225-227; Lee et al. (1996), Biochemistry 35:6010-6019). The indicator 
cells contain luminescent reporters consisting of a fluorescent protein biosensor of 
PFK-2 phosphorylation. The fluorescent protein biosensor is constmcted by 
introducing an environmentally sensitive fluorescent dye near to the known 
phosphorylation site of the enzyme (Deprez et al. (1997), supra; Giuliano et al. (1995), 
supra). The dye can be of the ketocyanine class (Kessler and Wolfbeis (1991), 
Spectrochimica Acta 47A:187-192 ) or any class that contains a protein reactive moiety 
and a fluorochrome whose excitation or emission spectrum is sensitive to solution 
polmty. The fluorescent protein biosensor is introduced into the indicator cells using 
bulk loading methodology. 

Living indicator cells are treated with test compounds, at final concentrations 
ranging fi-om 10"'^ M to 10"^ M for times ranging from 0.1 s to 10 h. In a preferred 
embodiment, ratio image data are obtained fi-om living treated indicator cells by 
collecting a spectral pair of fluorescence images at each time point. To extract 
moiphometric data from each time point, a ratio is made between each pair of images 
by numerically dividing the two spectral images at each time point, pixel by pixel. 
Each pixel value is then used to calculate the fi-actional phosphorylation of PFK-2. At 
small fractional values of phosphorylation, PFK-2 stimulates carbohydrate catabolism. 
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At high fractionaJ . values of phosphorylation, PFK-2 stimulates ca^^ 
anabolism. . . . --.^ -y ■ r 

Protein kinas^ A ^ a^^ and localizatioii of subanits. In another 
5 embodiment of a high-cx>ntent .serpen, the domain localizatipn and actiyity of 
protein kinase A (PKA) within indicator cells ^^a^^ measured in response to treatment 
with tes;t comppjm . . , 

The indicator cells contain Imiimescent reporters including a fluorescent protein 
biosensor of PKA activation The fluorescent prote is constracted^ b 

10 introducing an enyironmentally^ sensUiye fluorescent dye into ttie, .catalytic subiinit of 
PKA near the site known to interac >yiA the regulatoiy^^ of I*KA (Harpotum 

et al. (1993), MoL Biol, of the Cell 4:993-1002; Johnson et al. (1996), Cell 85:149-158; 
Giuliano et al. (1995), supra). The dye can be of the ketocyanine class (Kessler, and 
Wolfbeis (1991), Spectrochimica Ac^^^^ 47A:187-192) or any class that contains a 

15 protein reactive moiety and a fluorochrome whose excitatipii or . ei^^^ spectrum is 
sensitive to, solution polmt>^.^^ '^ of BKA actiyati is 

inta;pduced into the indicator celjs using bulk Ipadmg me^ , , . , ^ 

_ In one embodiment, Ir^ indicator ceUs^.are treated with test .cpnipounds, at 
final concentrations^ from 1 OTP M to 1 0'^ M for times ranging from 0. 1 s to ,10 

20 h. In a prefen-ed embodiment^ are obtained living treated 

indicator cells. To extract biosensor data from each time point, a ratio is made between 
each pair of ^ images, and each pixel .value is then used to paJcwlate Jthe fractional 
actiyatipn pf^P {e.^„ separatipn of the catal^ and regulatory sfubxinits after cAMP 
binding)^ At hi^ fractional values of actiyity, PFK-2 stm 

25 withm &e living ^ , o 

To rnea3iu:e the tra^ pf the catalytic subuiiit of PIu^ indicator cells 

containing liraiinescent reporters are j^eated with test cpmppunds and the mpvement, of 
the reporters is nieasured in space and tinie using the cell screenmg system. The 
indicator cells coiit^in lumines^^^ cpnsisting pf domain markers jiised to 

30 measure the localization of the cytoplasmic and nuclear, domains. ..When the indicator 
cells are treated with a test compounds, the dynmiic redistribution pf a PKA 
fluorescent pr^^ intra.cellularly.as a series; of images oyer a 
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time scale ranging from 0.1 s to 10 h. Each image is analyzed by a method that 
quantifies the movement of the PKA between the cytoplasmic and nuclear domains. To 
do this calculation, the images of the probes used to mark the cytoplasmic and nuclear 
domains are used to mask the image of the PKA fluorescent protein biosensor. The 
5 integrated brightness per unit area under each mask is used to form a translocation 
quotient by dividing the cytoplasmic integrated brightness/area by the nuclear 
integrated brightness/area. By comparing the translocation quotient values from 
control and experimental wells, the percent translocation is calculated for each potential 
lead compound. The output of the high-content screen relates quantitative data 
10 describing the magnitude of the translocation within a large number of individual cells 
that have been treated with test compound in the concentration range of 10'^^ M to 10*'* 
M. 

High-content screens involving the induction or inhibition of gene expression 

15 RNA-based fluorescent biosensors 

Cytoskeletal protein transcription and message localization. Regulation of 
the general classes of cell physiological responses including cell-substrate adhesion, 
cell-cell adhesion, signal transduction, cell-cycle events, intermediary and signaling 
molecule metaboUsm, cell locomotion, cell-cell communication, and cell ^eath can 

20 involve the alteration of gene expression. High-content screens can also be designed to 
measure this class of physiological response. 

In one embodiment, the reporter of intracellular gene expression is an 
oligonucleotide that can hybridize with the target mRNA and alter its fluorescence 
signal. In a preferred embodiment, the oligonucleotide is a molecular beacon (Tyagi 

25 and Kramer (1996) Nat BiotechnoL 14:303-308), a luminescence-based reagent whose 
fluorescence signal is dependent on intermolecular and intramolecular interactions. 
The fluorescent biosensor is constructed by introducing a fluorescence energy transfer 
pair of fluorescent dyes such that there is one at each end (5* and 3') of the reagent. 
The dyes can be of any class that contains a protein reactive moiety and fluorpchromes 

30 whose excitation and emission spectra overlap sufficiently to provide fluorescence 
energy transfer between the dyes in the resting state, including, but not Umited to, 
fluorescein and rhodamine (Molecular Probes, Inc.). In a preferred embodiment, a 
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portion of the message coding for P-actin (Kislauskis et al, (1994), J, Cell BioL 
127:441-451; McCann et al. (1997), Proc. Natl Acad, ScL 94:5679-5684; Sutoh 
(1982), Biochemistry 21:3654-3661) is inserted into the loop region of a hairpin-shaped 
.oligonucleotide with the ends tethered together due to intramolecular hybridization. At 
5 each end of the biosensor a fluorescence donor (fluorescein) and a fluorescence 
acceptor (rhodamine) are covalently bound. In the tethered state, the fluorescence 
energy transfer is maximal and therefore indicative of an unhybridized molecule. 
When hybridized with the mRNA coding for p-actin, the tether is broken and energy 
transfer is lost. The complete fluorescent biosensor is introduced into the indicator 

10 cells using bulk loading methodology. 

In one embodiment, living indicator cells are treated with test compounds, at 
final concentrations ranging from 10'^^ M to 10'^ M for times ranging from 0.1 s to 10 
h. In a preferred embodiment, ratio image data are obtained from living treated 
indicator cells. To extract morphometric data from each time point, a ratio is made 

15 between each pair of images, and each pixel value is then used to calculate the 
fractional hybridization of the labeled nucleotide. At small fractional values of 
hybridization little expression of p-actin is indicated. At high fractional values of 
hybridization, maximal expression of P-actin is indicated. Furthermore, the distribution 
of hybridized molecules within the cytoplasrri of the indicator cells is also a measurd'of 

20 the physiological response of the indicator cells. 

Cell surface binding of a ligand 

Labeled insulin binding to its cell surface receptor in living cells. Cells 
whose plasma membrane domain has been labeled with a labeling reagent of a 

25 particular color are incubated with a solution containing insulin molecules (Lee et al. 
(1997), Biochemistry 36:2701-2708; Martinez-Zaguilan et al. (1996), Am. J, Physiol 
270:C1438-C1446) that are labeled with a luminescent probe of a different color for an 
appropriate time under the appropriate conditions. After incubation, unbound insulin 
molecules are washed away, the cells fixed and the distribution and concentration of the 

30 insulin on the plasma membrane is measured. To do this, the cell membrane image is 

used as a mask for the insulin image. The integrated intensity, from the masked insulin 

image is compared to a set of images containing known amounts of labeled insulin. 
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Tlie amount of insulin bound to the cell is determined from the standards and used in 
conjunction with the total concentration of insuHn incubated with the cell to calculate a 
dissociation constant or insulin to its cell surface receptor. 

Labeling of cellular compartments 
Whole cell labeling 

Whole cell labeling is accomplished by labeling cellular components such that, 
dynamics of cell shape and motility of the cell can be measured over time by analyzing 
fluorescence images of cells. 

In one embodiment, small reactive fluorescent molecules are introduced into 
living cells. These membrane-permeant molecules both diffuse through and react with 
protein components in the plasma membrane. Dye molecules react with intracellular 
molecules to both increase the fluorescence signal emitted from each molecule and to 
entrap the fluorescent dye within living cells. These molecules include reactive 
chloromethyl derivatives of aminocoumarins, hydroxycoumarins, eosin diacetate, 
fluorescein diacetate, some Bodipy dye derivatives, and tetramethylrhodamine. The 
reactivity of these dyes toward macromolecules includes free primary amino groups 
and free sulfliydryl groups. 

In another embodiment, the cell surface is labeled by allowing the cell to 
interact with fluorescently labeled antibodies or lectins (Sigma Chemical Company, St. 
Louis, MO) that react specifically with molecules on the cell surface. Cell surface 
protein chimeras expressed by the cell of interest that contain a green fluorescent 
protein, or mutant thereof, component can also be used to fluorescently label the entire 
cell surface. Once the entire cell is labeled, images of the entire cell or cell array can 
become a parameter in high content screens, involving the measurement of cell shape, 
motility, size, and growth and division. 

Plasma membrane labeling 

In one embodiment, labeling the whole plasma membrane employs some of the 
30 same methodology described above for labeling the entire cells. Luminescent 
molecules that label the entire cell surface act to delineate the plasma membrane. 
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In a second embodiment subdomains of the plasma membrane, the extracellular 
surface, the lipid bilayer, and the intracellular surface can be labeled separately and 
used as components of high content screens. In the first embodiment, the extracellular 
surface is labeled using a brief treatment with a reactive fluorescent molecule such as 
5 the succinimidyl ester or iodoacetamde derivatives of fluorescent dyes such as the 
fluoresceins, rhodamines, cyanines, and Bodipys. 

In a third embodiment, the extracellular surface is labeled using fluorescently 
labeled macromolecules with a high affinity for cell surface molecules. These include 
fluorescently labeled lectins such as the fluorescein, rhodamine, and cyanine 
10 derivatives of lectins .. derived from jack bean (Con A), red kidney' bean 
(erythroagglutinin PHA-E), or wheat germ. 

In a fourth embodiment, fluorescently labeled antibodies with a high affinity for 
cell surface components are used to label the extracellular region of the plasma 
membrane. Extracellular \ regions of cell surface receptors and ion chaimels are 
15 examples of proteins that can be labeled with antibodies. 

In a fifth embodiment, the lipid bilayer of the plasma membrane is labeled with 
fluorescent molecules. These molecules include fluorescent dyes attached to long chain 
hydrophobic molecules that interact strongly with the hydrophobic region in the center 
of the plasma membrane lipid bilayer. Examples of these dyes include the PKH series 
20 of dyes (U.S. 4,783,401, 4,762701, and 4,859,584; available commercially from Sigma 
Chemical Company, St, Loius, MO), fluorescent phospholipids such as 
nitrobenzoxadiazole glycerophosphoethanolamine and fluorescein-derivatized 
dihexadecanoylglycerophosphoetha-nolamine, fluorescent fatty acids such as 5-butyl- 
4,4-difluoro-4-bora-3a,4a-diaza-s-indacene-3-nonanoic acid and 1-pyrenedecanoic acid 
25 (Molecular Probes, Inc.), fluorescent sterols including cholesteryl 4,4-difluoro-5,7- 
dimethyl-4-bora-3 a,4a-diaza-s-indacene-3-dodecanoate and cholesteryl 1 - 
pyrenehexanoate, and fluorescently labeled proteins that interact specifically with lipid 
bilayer components such as the fluorescein derivative of annexin V (Caltag Antibody 
Co, Burlingame, CA). 

30 In another embodiment, the intracellular component of the plasma membrane is 

labeled with fluorescent molecules. Examples of these molecules are the intracellular 
components of the trimeric G-protein receptor, adenylyl cyclase, and ionic transport 
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proteins. These molecules can be labeled as a result of tight binding to a fluorescently 
labeled specific antibody or by the incorporation of a fluorescent protein chimera that is 
comprised of a membrane-associated protein and the green fluorescent protein, and 
mutants thereof. 

5 

Endosome fluorescence labeling 

In one embodiment, ligands that are transported into cells by receptor-mediated 
endocytosis are used to trace the dynamics of endosomal organelles. Examples of 
labeled ligands include Bodipy FL-labeled low density lipoprotein complexes, 
10 tetramethylrhodamine transferrin analogs, and fluorescently labeled epidermal growth 
factor (Molecular Probes, Inc.) 

In a second embodiment, fluorescently labeled primary or secondary antibodies 
(Sigma Chemical Co. St. Louis, MO; Molecular Probes, Inc. Eugene, OR; Caltag 
Antibody Co.) that specifically label endosomal ligands are used to mark the 
15 endosomal compartment in cells. 

In a third embodiment, endosomes are fluorescently labeled in cells expressing 
protein chimeras formed by fusing a green fluorescent protein, or mutants thereof, with 
a receptor whose internalization labels endosomes. Chimeras of the EGF, transferrin, 
and low^density lipoprotein receptors are examples of these molecules. 

20 

Lysosome labeling 

In one embodiment, membrane permeant lysosome-specific luminescent 
reagents are used to label the lysosomal compartment of living and fixed cells. These 
reagents include the luminescent molecules neutral red, N-(3-((2,4- 

25 dinitrophenyl)amino)propyl)-N-(3-aminopropyl)methylamine, and the LysoTracker 
probes which report intralysosomal pH as well as the dynamic distribution of 
lysosomes (Molecular Probes, Inc.) 

In a second embodiment, antibodies against lysosomal antigens (Sigma 
Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 

30 lysosomal components that are localized in specific lysosomal domains. Examples of 
these components are the degradative enzymes involved in cholesterol ester hydrolysis, 

82 



BNSDOCID: <WO 0050B7aA2^L> 



wo 00/50872 



PCT/USOO/04794 



membrane protein proteases, and nucleases as well as the ATP-driven lysosomal proton 
pump. 

In a third embodiment, protein chimeras consisting of a lysosomal protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
5 protein, or mutants thereof, are used to label the lysosomal domain. Examples of these 
components are the degradative enzymes involved in cholesterol ester hydrolysis, 
membrane protein proteases, and nucleases as well as the ATP-driven lysosomal proton 
pump. 

10 Cytoplasmic fluorescence labeling 

In one embodiment, cell permeant fluorescent dyes (Molecular Probes, Inc.) 
with a reactive group are reacted with living cells. Reactive dyes including 
monobromobimane, 5-chloromethylfluorescein diacetate, carboxy fluorescein diacetate 
succinimidyl ester, and chlororaethyl tetramethylrhodamine are examples of cell 
15 permeant fluorescent dyes that are used for long term labeling of the cytoplasm of cells. 

In a second embodiment, polar tracer molecules such as Lucifer yellow and 
cascade blue-based fluorescent dyes (Molecular Probes, Inc.) are introduced into cells 
using bulk loading methods and are also used for cytoplasmic labeling. 

In a third embodiment, antibodies against cytoplasmic components (Sigma 
20 Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to fluorescently 
label the cytoplasm. Examples of cytoplasmic antigens are many of the enzymes 
involved in intermediary metabolism. Enolase, phosphofiuctokinase, and acetyl-CoA 
dehydrogenase are examples of uniformly distributed cytoplasmic antigens. 

In a fourth embodiment, protein chimeras consisting of a cytoplasmic protein 
25 genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the cytoplasm. Fluorescent chimeras of 
uniformly distributed proteins are used to label the entire cytoplasmic domain. 
Examples of these proteins are many of the proteins involved in intermediary 
metabolism and include enolase, lactate dehydrogenase, and hexokinase. 

30 In a fifth embodiment, antibodies against cytoplasmic antigens (Sigma 

Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 

cytoplasmic components that are localized in specific cytoplasmic sub-domains. 
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Examples of these components are the cytoskeletal proteins actin, tubulin, and 
cytokeratin. A population of these proteins within cells is assembled into discrete 
structures, which in this case, are fibrous. Fluorescence labeling of these proteins with 
antibody-based reagents therefore labels a specific sub-domain of the cytoplasm. 
5 In a sixth embodiment, non-antibody-based fluorescently labeled molecules that 

interact strongly with cj^oplasmic proteins are used to label specific cytoplasmic 
components. One example is a fluorescent analog of the enzyme DNAse I (Molecular 
Probes, Inc.) Fluorescent analogs of this en2yme bind tightly and specifically to 
cytoplasmic actin, thus labeling a sub-domain of the cytoplasm. In another example, 
10 fluorescent analogs of the mushroom toxin phalloidin or the drug paclitaxel (Moiecular 
Probes, Inc.) are used to label components of the actin- and microtubule-cytoskeletons, 
respectively. 

In a seventh embodiment, protein chimeras consisting of a cytoplasmic protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
15 protein, or mutants thereof, are used to label specific domains of the cytoplasm. 
Fluorescent chimeras of highly localized proteins are used to label cytoplasmic sub- 
domains. Examples of these proteins are many of the proteins involved in regulating 
the cytoskeleton. They include the structural proteins actin, tubulin, and cytokeratin as 
well as the regulatory proteins microtubule associated protein 4 and a-actinin. 

20 

Nuclear labeling 

In one embodiment, membrane permeant nucleic-acid-specific luminescent 
reagents (Molecular Probes, Inc.) are used to label the nucleus of living and fixed cells. 
These reagents include cyanine-based dyes (e.^., TOTO®, YOYO®, and BOBO^, 
25 phenanthidines and acridines (e.^., ethidium bromide, propidium iodide, and acridine 
orange), indoles and imidazoles {e.g., Hoechst 33258, Hoechst 33342, and 4',6- 
diamidino-2-phenyiindole), and other similar reagents (e.g-., 7-aminoactinomycin D, 
hydroxy stilbamidine, arid the psoralens). 

In a second embodiment, antibodies against nuclear antigens (Sigma Chemical 
30 Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label nuclear 
components that are localized in specific nuclear domains. Examples of these 
components are the macromolecules involved in maintaining DNA structure and 
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function. DNA, RNA, histones, DNA polymerase, RNA polymerase, lamins, and 
nuclear variants of cytoplasmic proteins such as actin are examples of nuclear antigens. 

In a third embodiment, protein chimeras consisting of a nuclear protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the nuclear domain. Examples of these 
proteins are many of the proteins involved in maintaining DNA structure and function. 
Histones, DNA polymerase, RNA polymerase, lamins, and nuclear variants of 
cytoplasmic proteins such as actin are examples of nuclear proteins. 

Mitochondrial labeling 

In one embodiment, membrane permeant mitochondrial-specific luminescent 
reagents (Molecular Probes, Inc.) are used to label the mitochondria of living and fixed 
cells. These reagents include rhodamine 123, tetramethyl rosamine, JC-1, and the 
MitoTracker reactiye dyes. 

In a second embodiment, antibodies against mitochondrial antigens (Sigma 
Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 
mitochondrial components that are localized in specific mitochondrial domains. 
Examples of these components are the macromolecules involved in maintaining 
mitochondrial DNA structure and function. DNA, RNA, histones, DNA polyrnerase, 
RNA polymerase, and nfiitochondrial variants of cytoplasmic macromolecules such as 
mitochondrial tRNA and rRNA are examples mitochondrial antigens. Other examples 
of mitochondrial antigens are the components of the oxidative phosphorylation system 
found in the mitochondria (e.g-., cytochrome c, cytochrome c oxidase, and succinate 
dehydrogenase). 

In a third embodiment, protein chimeras consisting of a mitochondrial protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the mitochondrial domain. Examples of 
these components are the macromolecules involved in maintaining mitochondrial DNA 
structure and fiinction. Examples include histones, DNA polymerase, RNA 
polymerase, and the components of the oxidative phosphorylation system foxmd in the 
mitochondria {e.g.^ cytochrome c, cytochrome c oxidase, and succinate 
dehydrogenase). 
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Endoplasmic reticulum labeling 

In one embodiment, membrane peraieant endoplasmic reticulnm-specific 
luminescent reagents (Molecular Probes, Inc.) are used to label the endoplasmic 
reticulum of living and fixed cells. These reagents include short chain carbocyanine 
5 dyes {e.g.y DiOC6 and DiOC3), long chain carbocyanine dyes (e.g., DilCje and DilCig), 
and luminescently labeled lectins such as concanavalin A. 

In a second embodiment, antibodies against endoplasmic reticulum antigens 
(Sigma Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 
endoplasmic reticulum components that are localized in specific endoplasmic reticulima 
10 domains. Examples of these components are the macromolecules involved in the fatty 
acid elongation systems, glucose-6-phosphatase, and HMG CoA-reductase. 

In a third embodiment, protein chimeras consisting of a endoplasmic reticulum 
protein genetically fiised to an intrinsically luminescent protein such as the green 
fluorescent protein, or mutants thereof, are used to label the endoplasmic reticulimi 
15 domain. Examples of these components are the macromolecules involved in the fatty 
acid elongation systems, glucose-6-phosphatase, and HMG CoA-reductase. 

Golgi labeling 

In one embodiment, membrane permeant Golgi-specific luminescent reagents 
(Molecular Probes, Inc.) are used to label the Golgi of living and fixed cells. These 
20 reagents include luminescently labeled macromolecules such as wheat germ agglutinin 
and Brefeldin A as well as luminescently labeled ceramide. 

In a second embodiment, antibodies against Golgi antigens (Sigma Chemical 
Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label Golgi components 
that are localized in specific Golgi domains. Examples of these components are N- 
25 acetylglucosamine phosphotransferase, Golgi-specific phosphodiesterase, and 
mannose-6-phosphate receptor protein. 

In a third embodiment, protein chimeras consisting of a Golgi protein 
genetically fiised to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the Golgi domain. Examples of these 
30 components are N-acetylglucosamine phosphotransferase, Golgi-specific 
phosphodiesterase, and mannose-6-phosphate receptor protein. 
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While many of the examples presented involve the measurement of single 
cellular processes, this is again is intended for purposes of illustration only. Multiple 
parameter high-content screens can be produced by combining several single parameter 
screens into a multiparameter high-content screen or by adding cellular parameters to 
5 any existing high-content screen. Furthermore, while each example is described as 
being based on either live or fixed cells, each high-content screen can be designed to be 
used with both live and fixed cells. 

Those skilled in the art will recognize a wide variety of distinct screens that can 
be developed based on the disclosure provided herein. There is a large and growing list 

10 of known biochemical and molecular processes in cells that involve translocations or 
reorganizations of specific components within cells. The signaling pathway firom the 
cell surface to target sites within the cell involves the translocation of plasma 
membrane-associated proteins to the cytoplasm. For example, it is known that one of 
the src family of protein tyrosine kinases, pp60c-src (Walker et al (1993), J. BioL 

15 Chem. 268:19552-19558) translocates firom the plasma membrane to the cytoplasm 
upon stimulation of fibroblasts with platelet-derived growth factor (PDGF). 
Additionally, the targets for screening can themselves be converted into fluorescence- 
based reagents that report molecular changes including ligand-binding and post- 
translocational modifications. 

20 

Example 10. Protease Biosensors 
(1) Background 

As used herein, the following terms are defined as follows: 

• Reactant - the parent biosensor that interacts with the proteolytic enzyme. 

25 • Product - the signal-containing proteolytic fi-agment(s) generated by the interaction 
of the reactant with the enzyme, 

• Reactant Target Sequence - an amino acid sequence that imparts a restriction on the 
cellular distribution of the reactant to a particular subcellular domain of the cell. 

• Product Target Seouence - an amino acid sequence that imparts a restriction on the 
30 cellular distribution of the signal-containing product(s) of the targeted enzymatic 

reaction to a particular subcellular domain of the celL If the product is initially 
localized within a membrane bound compartment, then the Product Target 
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Sequence must incorporate the ability to export the product out of the membrane- 
bound compartment. A bi-fiinctional sequence can be used, which first moves the 
product out of the membrane-bound compartment, and then targets the product to 
the final compartment. In general, the same amino acid sequences can act as either 
or both reactant target sequences and product target sequences. Exceptions to this 
include amino acid sequences which target the nuclear envelope, Golgi apparatus, 
endoplasmic reticuulum, and which are involved in famesylation, which are more 
suitable as reactant target sequences. 

• Protease Recognition Site - an amino acid sequence that imparts specificity by 
mimicking the substrate, providing a specific binding and cleavage site for a 
protease. Although typically a short sequence of amino acids representing the 
minimal cleavage site for a protease (e.g. DEVD for caspase-3. Villa, P., S.H. 
Kaufmann, and W.C. Eamshaw. 1997. Caspases and caspase inhibitors. Trends 
Biochem ScL 22:388-93), greater specificity may be established by using a longer 
sequence from an established substrate. 

• Compartment - any cellular sub-structure or macromolecular component of the cell, 
whether it is made of protein, lipid, carbohydrate, or nucleic acid. It could be a 
macromolecular assembly or an organelle (a membrane delimited cellular 
component). Compartments include, but are not limited to, cytoplasm, nucleus, 
nucleolus, inner and outer surface of nuclear envelope, cytoskeleton, peroxisome, 
endosome, lysosome, inner leaflet of plasma membrane, outer leaflet of plasma 
membrane, outer leaflet of mitochondrial membrane, inner leaflet of mitochondrial 
membrane, Golgi, endoplasmic reticulum, or extracellular space. 

Signal - an amino acid sequence that can be detected. This includes, but is not 
limited to inherently fluorescent proteins (e.g. Green Fluorescent Protein), cofactor- 
reqiiiring fluorescent or luminescent proteins (e.g. phycobiliproteins or luciferases), 
and epitopes recognizable by specific antibodies or other specific natural or 
unnatural binding probes, including biit not limited to dyes, enzyme cofactors and 
engineered binding molecules, which are fluorescently or lummescently labeled. 
Also included are site-specifically labeled proteins that contain a luminescent dye. 
Methodology for site-specific labeling of proteins includes, but is not limited to, 
engineered dye-reactive amino acids (Post, et al., J. Biol Chem,, 269:12880-12887 
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(1994)), enzyme-based incorporation of luminescent substrates into proteins 
(Buckler, et al„ Analyt. Biochem. 209:20-31 (1993); Takashi, Biochemistry. 
27:938-943 (1988)), and the incorporation of unnatural labeled amino acids into 
proteins (Noren, et al.. Science. 244:182-188 (1989)). 
• Detection - a means for recording the presence, position, or amoimt of the signal. 
The approach may be direct, if the signal is inherently fluorescent, or indirect, if, for 
example, the signal is an epitope that must be subsequently detected with a labeled 
antibody. Modes of detection include, but are not limited to, the spatial position of 
fluorescence, luminescence, or phosphorescence: (1) intensity; (2) polarization; (3) 
lifetime; (4) wavelength; (5) energy transfer; and (6) recovery after photobleaching. 
The basic principle of the protease biosensors of the present invention is to 
spatially separate the reactants from the products generated during a proteolytic 
reaction. The separation of products from reactants occurs upon proteolytic cleavage of 
the protease recognition site v/ithin the biosensor, allowing the products to bind to, 
diffuse into, or be imported into compartments of the cell different from those of the 
reactant. This spatial separation provides a means of quantitating a proteolytic process 
directly in living or fixed cells. Some designs of the biosensor provide a means of 
restricting the reactant (uncleaved biosensor) to a particular compartment by a protein 
sequence ("reactant target sequence") that binds to or imports the biosensor into a 
compartment of the cell. These compartments include, but are not limited to any 
cellular substructure, macromolecular cellular component, membrane-limited 
organelles, or the extracellular space. Given that the characteristics of the proteolytic 
reaction are related to product concentration divided by the reactant concentration, the 
spatial separation of products and reactants provides a means of uniquely quantitating 
products and reactants in single cells, allowing a more direct measure of proteolytic 
activity. 

The molecular-based biosensors may be introduced into cells via transfection 
and the expressed chimeric proteins analyzed in transient cell populations or stable cell 
lines. They may also be pre-formed, for example by production in a prokaryotic or 
eukaryotic expression system, and the purified protein introduced into the cell via a 
number of physical mechanisms including, but not limited to, micro-injection, scrape 
loading, electroporation, signal-sequence mediated loading, etc. 
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Measurement modes may include, but are not limited to, the ratio or difference 
in fluorescence, luminescence, or phosphorescence: (a) intensity; (b) polarization; or (c) 
lifetime between reactant and product These latter modes require appropriate 
spectroscopic differences between products and reactants. For example, cleaving a 
5 reactant containing a limited-mobile signal into a very small translocating component 
and a relatively large non-translocating component may be detected by polarization. 
Alternatively, significantly different emission lifetitnes between reactants and products 
allow detection in imaging and non-imaging modes. 

One example of a family of enzymes for which this biosensor can be 

10 constructed to report activity is the caspases. Caspases are a class of proteins that 
catalyze proteolytic cleavage of a wide variety of targets during apoptosis. Following 
initiation of apoptosis, the Class n "downstream" caspases are activated and are the 
point of no return in the pathway leading to cell death, resulting in cleavage of 
downstream target proteins. In specific examples, the biosensors described here were 

15 engineered to use nuclear translocation of cleaved GFP as a measurable indicator of 
caspase activation. Additionally, the use of specific recognition sequences that 
incorporate surrounding amino acids involved in secondary structure formation in 
naturally occurring proteins may increase the specificity and sensitivity of this class of 
biosensor. 

20 Another example of a protease class for which this biosensor can be constructed 

to report activity is zinc metalloproteases. Two specific examples of this class are the 
biological toxins derived from Clostridial species (C. botulinum and C. tetani) and 
Bacillus anthracis. (Herreros et al. In The Comprehensive Sourcebook of Bacterial 
Protein Toxins. J.E. Alouf and J.H. Freer, Eds. 2"** edition, San Diego, Academic Press, 

25 1999; pp 202-228.) These bacteria express and secrete zinc metalloproteases that enter 
eukaryotic cells and specifically cleave distinct target proteins. For example, the 
anthrax protease from Bacillus anthracis is delivered into the cytoplasm of target cells 
via an accessory pore-forming protein, where its proteolytic activity inactivates the 
MAP-kinase signaling cascade through cleavage of mitogen activated protein kinase 

30 kinases 1 or 2 (MEKl or MEK2). (Leppla, S.A. In The Comprehensive Sourcebook of 
Bacterial Protein Toxins. J.E. Alouf and J.H. Freer, Eds. 2"^* edition, San Diego, 
Academic Press, 1999; pp243-263.) The toxin biosensors described here take 
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advantage of the natural subcellular localization of these and other target proteins to 
achieve reactant targeting. Upon cleavage, the signal (with or without a product target 
sequence) is separated from the reactant to create a high-content biosensor. 

One of skill in the art will recognize that the protein biosensors of this aspect of 
5 the invention can be adapted to report the activity of any member of the caspase family 
of proteases, as well as any other protease, by a substitution of the appropriate protease 
recognition site in any of the constructs (see Figure 29B). These biosensors can be 
used in high-content screens to detect in vivo activation of enzymatic activity and to 
identify specific activity based on cleavage of a known recognition motif. This screen 
10 can be used for both live cell and fixed end-point assays, and can be combined with 
additional measurements to provide a multi-parameter assay. 

Thus, in another aspect the present invention provides recombinant nucleic acids 
encoding a protease biosensor, comprising: 

a. a first nucleic acid sequence that encodes at least one detectable 
15 polypeptide signal; 

b. a second nucleic acid sequence that encodes at least one protease 
recognition site, wherein the second nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal; 
and 

20 c. a third nucleic acid sequence that encodes at least one reactant target 

sequence, wherein the third nucleic acid sequence is operatively linked to the second 
nucleic acid sequence that encodes the at least one protease recognition site. 

In this aspect, the first and third nucleic acid sequences are separated by the 
25 second nucleic acid sequence, which encodes the protease recognition site. 

In a further embodiment, the recombinant nucleic acid encoding a protease 
biosensor comprises a fourth nucleic acid sequence that encodes at least one product 
target sequence, wherein the fourth nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal. 
30 In a further embodiment, the recombinant nucleic acid encoding a protease 

biosensor comprises a fifth nucleic acid sequence that encodes at least one detectable 
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polypeptide signal, wherein the fifth nucleic acid sequence is operatively linked to the 
third nucleic acid sequence that encodes the reactant target sequence. 

In a preferred embodiment, the detectable polypeptide signal is selected from 
the group consisting of fluorescent proteins, luminescent proteins, and sequence 
5 epitopes. In a most preferred embodiment, the first nucleic acid encoding a polypeptide 
sequence comprises a sequence selected from the group consisting of SEQ ID NOS: 35, 
37, 39, 41, 43, 45, 47, 49, and 51. 

In another preferred embodiment, the second nucleic acid encoding a protease 
recognition site comprises a sequence selected fi-om the group consisting of SEQ ID 

10 NOS: 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 
95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, and 121. In another 
preferred embodiment, the third nucleic acid encoding a reactant target sequence 
comprises a sequence selected from the group consisting of SEQ ID NOS: 123, 125, 
127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, and 151. 

15 In a most preferred embodiment, the recombinant nucleic acid encoding a 

protease biosensor comprises a sequence substantially similar to sequences selected 
fi-om the group consisting of SEQ ID N0S:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31, and 33. 

In another aspect, the present invention provides a recombinant expression 
20 vector comprising nucleic acid control sequences operatively linked to the above- 
described recombinant nucleic acids. In a still further aspect, the present invention 
provides genetically engineered host cells that have been transfected with the 
recombinant expression vectors of the invention. 

In another aspect, the present invention provides recombinant protease 
25 biosensors comprising 

a. a first domain comprising at least one detectable polypeptide 

signal; 

b. a second domain comprising at least one protease recognition 

site; and 

30 c, a third domain comprising at least one reactant target sequence; 

wherein the first domain and the third domain are separated by the 
second domain. 
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Inherent in this embodiment is the concept that the reactant target sequence 
restricts the cellular distribution of the reactant, with redistribution of the product 
occurring after activation (ie: protease cleavage). This redistribution does not require a 
complete sequestration of products and reactants, as the product distribution can 
5 partially overlap the reactant distribution in the absence of a product targeting signal 
(see below). 

In a preferred embodiment, the recombinant protease biosensor further 
comprises a fourth domain comprising at least one product target sequence, wherein the 
fourth domain and the first domain are operatively linked and are separated from the 

10 third domain by the second domain. In another embodiment, the recombinant protease 
biosensor further comprises a fifth domain comprising at least one detectable 
polypeptide signal, wherein the fifth domain and the third domain are operatively 
linked and are separated from the first domain by the second domain. 

In a preferred embodiment, the detectable polypeptide signal domain (first or 

15 fifth domain) is selected fi-om the group consisting of fluorescent proteins, luminescent 
proteins, and sequence epitopes. In a most preferred embodiment, the detectable 
polypeptide signal domain comprises a sequence selected fi-om the group consisting of 
SEQ ID NOS:36, 38, 40, 42, 44, 46, 48, 50, and 52. 

In another preferred embodiment, the second domain comprising a protease 

20 recognition site comprises a sequence selected from the group consisting of SEQ ID 
NOS:54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 
98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, and 122. In another 
preferred embodiment, the reactant and/or target sequence domains comprise a 
sequence selected from the group consisting of SEQ ID NOS:124, 126, 128, 130, 132, 

25 134, 136, 138, 140, 142, 144, 146, 148, 150, and 152.. 

In a most preferred embodiment, the recombinant protease biosensor comprises 
a sequence substantially similar to sequences selected from the group consisting of 
SEQ ID NO:2, 4, 6\ 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, and 34. 

In a still further embodiment, the present invention provides methods and kits 

30 for automated analysis of cells, comprising using cells that possess the protease 
biosensors of the invention to identify compounds that affect protease activity. The 
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method can be combined with the other methods of the invention in a variety of 
possible multi-parametric assays. 

In these various embodiments, the basic protease biosensor is composed of 
multiple domains, including at least a first detectable polypeptide signal domain, at 
5 least one reactant target domain, and at least one protease recognition domain, wherein 
the detectable signal domain and the reactant target domain are separated by the 
protease recognition domain. Thus, the exact order of the domains in the molecule is 
not generally critical, so long as the protease recognition domain separates the reactant 
target and first detectable signal domain. For each domain, one or more one of the 

10 specified recognition sequences is present 

In some cases, the order of the domains in the biosensor may be critical for 
appropriate targeting of product(s) and/or reactant to the appropriate cellular 
compartment(s). For example, the targeting of products or reactants .to the peroxisome 
requires that the peroxisomal targeting domain comprise the last three amino acids of 

15 the protein. Determination of those biosensor in which the relative placement of 
targeting domains within the biosensor is critical can be determined by one of skill in 
the art through routine experimentation. 

Some examples of the basic organization of domains within the protease 
biosensor are shown in Figure 30. One of skill in the art will recognize that any one of 

20 a wide variety of protease recognition sites, product target sequences, polypeptide 
signals, and/or product target sequences can be used in various combinations in the 
protein biosensor of the present invention, by substituting the appropriate coding 
sequences into the multi-domain construct. Non-limiting examples of such alternative 
sequences are shown in Figure 29A-29C. Similarly, one of skill in the art will 

25 recognize that modifications, substitutions, and deletions can be made to the coding 
sequences and the amino acid sequence of each individual domain within the biosensor, 
while retaining the fimction of the domain. Such various combinations of domains and 
modifications, substitutions and deletions to individual domains are within the scope of 
the invention. 

30 As used herein, the temi "coding sequence" or a sequence which "encodes" a 

particular polypeptide sequence, refers to a nucleic acid sequence which is transcribed 
(in the case of DNA) and translated (in the case of mRNA) into a polypeptide in vitro 
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or in vivo when placed under the control of appropriate regulatory sequences. The 
boundaries of the coding sequence are determined by a start codon at the 5' (amino) 
terminus and a translation stop codon at the 3* (carboxy) terminus. A coding sequence 
can include, but is not limited to, cDNA from prokaryotic or eukaryotic mRNA, 
5 genomic DNA sequences from prokaryotic or eukaryotic DNA, and synthetic DNA 
sequences. A transcription termination sequence will usually be located 3' to the coding 
sequence. 

As used herein, the term DNA "control sequences" refers collectively to 
promoter sequences, ribosome binding sites, polyadenylation signals, transcription 
10 termination sequences, upstream regulatory domains, enhancers, and the like, which 
collectively provide for the transcription and translation of a coding sequence in a host 
cell. Not all of these control sequences need always be present in a recombinant vector 
so long as the DNA sequence of interest is capable of being transcribed and translated 
appropriately. 

15 As used herein, the term "operatively linked" refers to an arrangement of 

elements wherein the components so described are configured so as to perform their 
usual fiinction. Thus, control sequences operatively linked to a coding sequence are 
capable of effecting the expression of the coding sequence. The control sequences need 
not be contiguous with the coding sequence, so long as they function to direct ^e 

20 expression thereof. Thus, for example, intervening untranslated yet transcribed 
sequences can be present between a promoter sequence and the coding sequence and 
the promoter sequence can still be considered "operatively linked" to the coding 
sequence. 

Furthermore, a nucleic acid coding sequence is operatively linked to another 
25 nucleic acid coding sequences when the coding region for both nucleic acid molecules 
are capable of expression in the same reading frame. The nucleic acid sequences need 
not be contiguous, so long as they are capable of expression in the same reading frame. 
Thus, for example, intervening coding regions can be present between the specified 
nucleic acid coding sequences, and the specified nucleic acid coding regions can still be 
30 considered "operatively linked". 

The intervening coding sequences between the various domains of the 
biosensors can be of any length so long as the flmction of each domain is retained, 
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Generally, this requires that the two-dimensional and three-dimensional structure of the 
intervening protein sequence does not preclude the binding or interaction requirements 
of the domains of the biosensor, such as product or reactant targeting, binding of the 
protease of interest to the biosensor, fluorescence or luminescence of the detectable 
5 polypeptide signal, or binding of fluorescently labeled epitope-specific antibodies. 

One case where the distance between domains of the protease biosensor is 
important is where the goal is to create a fluorescence resonance energy transfer pair. In 
this case, the FRET signal will only exist if the distance between the donor and 
acceptor is sufficiently small as to allow energy transfer (Tsien, Heim and Cubbit, WO 
10 97/28261). The average distance between the donor and acceptor moieties should be 
between 1 nm and' 10 nm with a preference of between 1 nm and 6 nm. This is the 
physical distance between donor and acceptor. The intervening sequence length can 
vary considerably since the three dimensional structure of the peptide will determine 
the physical distance between donor and acceptor. 
15 "Recombinant expression vector" includes vectors that operatively link a 

nucleic acid coding region or gene to any promoter capable of effecting expression of 
the gene product. The promoter sequence used to drive expression of the protease 
biosensor may be constitutive (driven by any of a variety of promoters, including but 
not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of 
20 inducible promoters including, but not limited to, tetracycline, ecdysone, steroid- 
responsive). The expression vector must be replicable in the host organisms either as 
an episome or by integration into host chromosomal DNA. hi a preferred embodiment, 
the expression vector comprises a plasmid. However, the invention is intended to 
include any other suitable expression vectors, such as viral vectors. 
25 The phrase "substantially similar " is used herein in reference to the nucleotide 

sequence of DNA, or the amino acid sequence of protein, having one or more 
conservative or non-conservative variations from the protease biosensor sequences 
disclosed herein, including but not limited to deletions, additions, or substitutions 
wherein the resulting nucleic acid and/or amino acid sequence is functionally 
30 equivalent to the sequences disclosed and claimed herein. Functionally equivalent 
sequences will function in substantially the same manner to produce substantially the 
same protease biosensor as the nucleic acid and amino acid compositions disclosed and 
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claimed herein. For example, functionally equivalent DNAs encode protease 
biosensors that are the same as those disclosed herein or that have one or more 
conservative amino acid variations, such as substitutions of non-polar residues for other 
non-polar residues or charged residues for similarly charged residues, or addition 
5 to/deletion from regions of the protease biosensor not critical for functionality. These 
changes include those recognized by those of skill in the art as substitutions, deletions, 
and/or additions that do not substantially alter the tertiary stmcture of the protein. 

As used herein, substantially similar sequences of nucleotides or amino acids 
share at least about 70%-75% identity, more preferably 80-85% identity, and most 

10 preferably 90-95% identity. It is recognized, however, that proteins (and DNA or 
mRNA encoding such proteins) containing less than the above-described level of 
homology (due to the degeneracy of the genetic code) or that are modified by 
conservative amino acid substitutions (or substitution of degenerate codons) are 
contemplated to be within the scope of the present invention. 

15 The temi "heterologous" as it relates to nucleic acid sequences such as coding 

sequences and control sequences, denotes sequences that are not normally associated 
with a region of a recombinant constmct, and/or are not normally associated with a 
particular cell. Thus, a "heterologous" region of a nucleic acid constmct is an 
identifiable segment of nucleic acid wifliin or attached to another nucleic acid molecule 

20 that is not found in association with the other molecule in nature. For example, a 
heterologous region of a constmct could include a coding sequence flanked by 
sequences not found in association with the coding sequence in nature. Another 
example of a heterologous coding sequence is a constmct where the coding sequence 
itself is not found in nature (e.g., synthetic sequences having codons different firom the 

25 native gene). Similarly, a host cell transformed with a constmct which is not nomially 
present in the host cell would be considered heterologous for purposes of this invention. 

Within this application, unless otherwise stated, the techniques utilized may be 
found in any of several well-known references such as: Molecular Cloning: A 
Laboratory Manual (Sambrook, et al, 1989, Cold Spring Harbor Laboratory Press), 

30 Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by D. 
Goeddel, 1991. Academic Press, San Diego, CA), "Guide to Protein Purification" in 
Methods in Enzymology (M.P. Deutshcer, ed., (1990) Academic Press, Inc.); PCR 
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Protocols: A Guide to Methods and Applications (Innis, et al. 1990. Academic Press, 
San Diego, CA), Culture of Animal Cells: A Manual of Basic Technique, 2"^ Ed. (R.I, 
Freshney. 1987. Liss, Inc. New York, NY), Gene Transfer and Expression Protocols^ 
pp. 109-128, ed. E.J. Murray, The Humana Press Inc., Clifton, NJ.), and the Ambion 
5 1 998 Catalog (Ambion, Austin, TX). 

The biosensors of the present invention are constructed and used to transfect 
host cells using standard techniques in the molecular biological arts. Any number of 
such techniques, all of which are within the scope of this invention, can be used to 
generate protease biosensor-encoding DNA constructs and genetically transfected host 
10 cells expressing the biosensors. The non-limiting examples that follow demonstrate 
one such technique for constructing the biosensors of the invention. 

EXAMPLE OF PROTEASE BIOSENSOR CONSTRUCTION AND USE: 

In the following examples, caspase-specific biosensors with specific product 

15 target sequences have been constructed using sets of 4 primers (2 sense and 2 
antisense). These primers have overlap regions at their termini, and are used for PCR 
via a primer walking technique. (Sambrook, J., Fritsch, E.F. and Maniatis, T. (1989 ) 
Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York) The two sense primers were chosen to start from the 5' 

20 polylinker (Bspl) of the GFP-containing vector (Clontech, California) to the middle of 
the designed biosensor sequence. The two antisense primers start from a 3' GFP vector 
site (Bam HI), and overlap with the sense primers by 12 nucleotides in the middle. 

PCR conditions were as follows: 94*^C for 30 seconds for denaturation, 55 °C for 
30 seconds for aimealing, and 72°C for 30 seconds for extension for 15 cycles. The 

25 primers have restriction endonuclease sites at both ends, facilitating subsequent cloning 
of the resulting PCR product. 

The resulting PCR product was gel purified, cleaved at BspEl and BamHl 
restriction sites present in the primers, and the resulting fragment was gel purified. 
Similarly, the GFP vector (Clontech, San Francisco, CA) was digested at BspEl and 

30 BamHl sites in the polylinker. Ligation of the GFP vector and the PCR product was 
performed using standard techniques at 16°C overnight. E. coli cells were transfected 
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with the ligation mixtures using standard techniques. Transformed cells were selected 
on LB-agar with an appropriate antibiotic. 

Cells and transfections. For DNA transfection, BHK cells and MCF-7 cells 
5 were cultured to 50-70% confluence in 6 well plates containing 3 ml of minimal 
Eagle's medium (MEM) with 10% fetal calf serum, 1 mM L-glutamine, 50 jag/ml 
streptomycin, 50 |-ig/ml penicillin, 0.1 mM non-essential amino acids, 1 mM sodium 
pyruvate and 10 M-g/ml of bovine insulin (for MCF-7 cell only) at 37 ^'C in a 5% CO2 
incubator for about 36 hours. The cells were washed with serum free MEM media and 

10 incubated for 5 hours with 1 ml of transfection mixture containing 1 jig of the 
appropriate plasmid and 4 ^ig of lipofectimine (BRL) in the semm free MEM media. 
Subsequently, the transfection medium was removed and replaced with 3 ml of normal 
culture media. The transfected cells were maintained in growth medium for at least 16 
hours before performing selection of the stable cells based on standard molecular 

15 biology methods (Ausubel. et al 1995). 

Apoptosis assay. For apoptosis assays, the cells (BHK, MCF-7) stably 
transfected with the appropriate protease biosensor expression vector were plated on 
tissue culture treated 96-well plates at 50-60% confluence and cultured overnight at 
20 37°C, 5% CO2. Varying concentrations of cis-platin, staurosporine, or paclitaxel in 
normal cultiu-e media were freshly prepared from stock and added to cell culture dishes 
to replace the old culture media. The cells were then observed with the cell screening 
system of the present invention at the indicated time points either as live cell 
experiments or as fixed end-point experiments. 

25 

1. Construction of 3-domain protease biosensors 

a. Caspase-3 biosensor with an annexin n reactant targeting domain 
(pljkGFP), 

The design of this biosensor is outlined in Figure 31, and its sequence is shovm 
30 in SEQ ID NO:l and 2. 
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Primers for Caspase 3, Product target sequence = none (CP3GFP-CYTO): 

1) TCA TCA TCC GGA GCT GGA GCC GGA GCT GGC CGA TCG GCT GTT 
AAA TCT GAA GGA AAG AGA AAG TGT GAG GAA GTT GAT GGA ATT 
GAT GAA GTA GGA (SEQ ID NO:153) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC ACC 
CTC CAA GCT GAG CTT GCA CAG GAT TTC GTG GAC AGT AGA 
CAT AGT ACT TGC TAC TTC ATC (SEQ ID NO:154) 

3) TCA TCA TCC GGA GCT GGA (SEQ ID NO:155) 

4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

This biosensor is restricted to the cytoplasm by the reactant target sequence. 
The reactant target sequence is the annexin n cytoskeletal binding domain 
(MSTVHEILCKLSLEGVHSTPPSA) (SEQ ID NO:124) (Figure 29C) (Eberhard et 
15 al. 1997. MoL Biol Cell 8:293a). The enzyme recognition site corresponds to two 
copies of the amino acid sequence DEVD (SEQ ID NO:60) (Figure 29B), which 
serves as the recognition site of caspase-3. Other examples with different nimibers of 
protease recognition sites and/or additional amino acids from a naturally occurring 
protease recognition site are shown below. The signal domain is EGFP (SEQ ID 
20 NO:46) (Figure 29A) (Clontech, California). The parent biosensor (the reactant) is 
restricted to the cytoplasm by binding of the annexin 11 domain to the cytoskeleton, and 
is therefore excluded from the nucleus. Upon cleavage of the protease recognition site 
by caspase 3, the signal domain (EGFP) is released from the reactant targeting domain 
(annexin 11), and is distributed throughout the whole volume of the cell, because it lacks 
25 any specific targeting sequence and is small enough to enter the nucleus passively. 
(Fig 32) 

The biosensor response is measured by quantitating the effective cytoplasm-to- 
nuclear translocation of the signal (see above). Measurement of the response is by one 
of several modes, including integrated or average nuclear region intensity, the ratio or 
30 difference of the integrated or average cytoplasm intensity to integrated or average 
nuclear intensity. The nucleus is defined using a DNA-specific dye, such as Hoechst 
33342. 
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This biosensor provides a measure of the proteolytic activity around the annexin 
II cytoskeleton binding sites within the cell. Given the dispersed nature of the 
cytoskeleton and the effectively diffuse state of cytosolic enzymes, this provides an 
effective measure of the cytoplasm in general. 

5 

Results & Discussion: 

Fig 32 illustrates images before and after stimulation of apoptosis by cis-platin 
in BHK cells, transfected with the caspase 3 biosensor. The images clearly illustrate 
accumulation of fluorescence in the nucleus. Generation of the spatial change in 

10 fluorescence is non-reversible and thus the timing of the assay is flexible. Controls for 
this biosensor include using a version in which the caspase-3-specific site has been 
omitted. In addition, dismption of the cytoskeleton with subsequent cell rounding did 
not produce the change in fluorescence distribution. Our experiments demonstrate the 
correlation of nuclear condensation with activation of caspase activity. We have also 

15 tested this biosensor in MCF-7 cells. A recent report measured a peak response in 
caspase-3 activity 6 h after stimulation of MCF-7 cells with etoposide accompanied by 
cleavage of PARP (Benjamin et al. 199BMol Pharmacol. 53:446-50). However, 
another recent report foimd that MCF-7 cells do not possess caspase-3 activity and, in 
fact, the caspase-3 gene is functionally deleted (Janicke et al. 1998. J Biol Chem, 

20 273:9357-60). Caspase-3 activity was not detected with the caspase biosensor in MCF- 
7 cells after a 15 h treatment with 100 ]iM etoposide. 

Janicke et al., (1998) also indicated that many of the conventional substrates of 
caspase-3 were cleaved in MCF-7 cells upon treatment with staurosporine. Our 
experiments demonstrate that caspase activity can be measured using the biosensor in 

25 MCF-7 cells when treated with staurosporine. The maximum magnitude of the 
activation by staurosporine was approximately one-half that demonstrated with cis- 
platin in BHK cells. This also implies that the current biosensor, although designed to 
be caspase-3 -specific, is indeed specific for a class of caspases rather than uniquely 
specific for caspase-3. The most likely candidate is caspase-7 (Janicke et al., 1998). 

30 These experiments also demonstrated that the biosensor can be used in multiparameter 
experiments, with the correlation of decreases in mitochondrial membrane potential, 
nuclear condensation, and caspase activation. 
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We have specifically tested the effects of paclitaxel on caspase activation using 
the biosensor. Caspase activity in BHK and MCF-7 cells was stimulated by paclitaxel. 
It also appears that caspase activation occurred after nuclear morphology changes. One 
caveat is that, based on the above discussions, the caspase activity reported by the 
5 biosensor in this assay is likely to be due to the combination of caspase-3 and, at least, 
caspase-7 activity. 

Consistent with the above results using staurosporine stimulation on MCF-7 
cells, paclitaxel also stimulated the activation of caspase activity. The magnitude was 
similar to that of staurosporine. This experiment used a much narrower range of 
10 paclitaxel than previous experiments where nuclear condensation appears to dominate 
the response. 

b. Caspase biosensor with the microtubule associated protein 4 
(MAP4) projection domain (CP8GFPNLS-SIZEPROJ) 

15 Another approach for restricting the reactant to the cytoplasm is to make the 

biosensor too large to penetmte the nuclear pores Cleavage of such a biosensor 
liberates a product capable of diffusing into the nucleus. 

The additional size required for this biosensor is provided by using the 
projection domain of MAP4 (SEQ ID NO:142) (Figure 29C) (CP8GFPNLS- 

20 SIZEPROJ). The projection domain of MAP4 does not interact with microtubules on 
its own, and, when expressed, is diffusely distributed throughout the cytoplasm, but is 
excluded from the nucleus due to its size (-120 kD). Thus, this biosensor is distinct 
from the one using the fiiU length MAP4 sequence, (see below) One of skill in the art 
will recognize that many other such domains could be substituted for the MAP4 

25 projection domain, including but not limited to multiple copies of any GFP or one or 
more copies of any other protein that lacks an active NLS and exceeds the maximum 
size for diffusion into the nucleus (approximately 60 kD; Alberts, B., Bray, D., Raff, 
M., Roberts^ K., Watson, J.D. (Eds.) Molecular Biology of the Cell , third edition. New 
York: Garland publishing, 1994. pp 561-563). The complete sequence of the resulting 

30 biosensor is shown in SEQ ID NO: 3-4. A similar biosensor with a dififerent protease 
recognition domain is shown in SEQ ID NO:S-6. 
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c Caspase biosensor with a nuclear export signal 

Another approach for restricting the reactant to the cytoplasm is to actively 
restrict the reactant from the nucleus by using a nuclear export signal. Cleavage of 
such a biosensor liberates a product capable of diffusing into the nucleus. 
5 The Bacillus anthracis bacterium expresses a zinc metalloprotease protein 

complex called anthrax protease. Human mitogen activated protein kinase kinase 1 
(MEK 1) (Seger et al., J. Biol. Chem. 267:25628-25631, 1992) possesses an anthrax 
protease recognition site (amino acids 1-13) (SEQ ID NO: 102) (Figure 29B) that is 
cleaved after amino acid 8, as well as a nuclear export signal at amino acids 32-44 

10 (SEQ ID NO:140) (Figure 29C). Human MEK 2 (Zheng and Guan, J. Biol. Chem. 
268:11435-11439, 1993) possesses an anthrax protease recognition site comprising 
amino acid residues 1-16 (SEQ ID NO:104) (Figure 29B) and a nuclear export signal 
at amino acids 36-48. (SEQ ID NO:148) (Figure 29Q. 

The anthrax protease biosensor comprises Fret25 (SEQ ID NO:48) (Figure 

15 29A) as the signal, the anthrax protease recognition site, and the nuclear export signal 
from MEK 1 or MEK2. (SEQ ID NOS: 7-8 (MEKl); 9-10 (MEK2)) The intact 
biosensor will be retained in the cytoplasm by virture of this nuclear export signal (eg., 
the reactant target site). Upon cleavage of the fusion protein by anthrax protease, the 
NES will be separated from the GFP allowing the GFP to diffuse into the nucleus. 

20 

2. Construction of 4- and 5-domain biosensors 

For all of the examples presented above for 3-domain protease biosensors, a 
product targeting sequence, including but not limited to those in Figure 29C, such as a 
nuclear localization sequence (NLS), can be operatively linked to the signal sequence, 
25 and thus cause the signal sequence to segregate from the reactant target domain after 
proteolytic cleavage. Addition of a second detectable signal domain, including but not 
limited to those in Figure 29A, operatively linked with the reactant target domain is 
also useful in allowing measurement of the reaction by multiple means. Specific 
examples of such biosensors are presented below. 

30 

a* 4 domain biosensors 

1. Caspase biosensors with nuclear localization sequences 
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(pcas3iilsGFP; CP3GFPNLS-CYTO): 

The design of the biosensor is outlined in Figure 33, and its sequence is shown 
in SEQ ID NO:ll-12. PGR and cloning procedures were performed as described 
above, except that the following oligonucleotides were used: 
5 Primers for Caspase 3, Product target sequence = NLS (CP3GFPNLS-CYTO) : 

1) TCA TCA TCC GGA AGA AGO AAA CGA CAA AAG CGATCGGCT 
GTT AAA TCT GAA GGA AAG AGA AAG TGT GAG GAA GTT GAT GGA 
ATT GAT GAA GTA GGA (SEQ ID NO:157) 
10 2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC ACC 
CTC CAA GCT GAG CTT GCA CAG GAT TTC GTG GAC AGT AGA 
CAT AGT ACT TGC TAG TTC ATC (SEQ ID NO: 154) 

3 ) TCA TCA TCC GGA AGA AGG (SEQ ID NO:158) 

4 ) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

15 

This biosensor is similar to that shown in SEQ ID NO:2 except upon 
recognition and cleavage of the protease recognition site, the product is released and the 
signal accumulates specifically in the nucleus due to the presence of a nuclear 
localization sequence, RRKRQK (SEQ ID NO: 128) (Figure 29C)(Briggs et al., J. 

20 Biol. Chem. 273:22745, 1998) attached to the signal. A specific benefit of this 
construct is that the products are clearly separated firom the reactants. The reactants 
remain in the cytoplasm, while the product of the enzymatic reaction is restricted to the 
nuclear compartment. The response is measured by quantitating the effective 
cytoplasm-to-nuclear translocation of the signal, as described above. 

25 With the presence of both product and reactant targeting sequences in the parent 

biosensor, the reactant target sequence should be dominant prior to activation (e.g., 
protease cleavage) of the biosensor. One way to accomplish this is by masking the 
product targeting sequence in the parent biosensor until after protease cleavage. In one 
such example, the product target sequence is functional only when relatively near the 

30 end of a polypeptide chain (ie: after protease cleavage). Altematively, the biosensor 
may be designed so that its tertiary structure masks the fionction of the target sequence 
until after protease cleavage. Both of these approaches include comparing targeting 
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sequences with different relative strengths for targeting. Using the example of the 
nuclear localization sequence (NLS) and annexin II sequences, different strengths of 
NLS have been tried with clone selection based on cytoplasmic restriction of the parent 
biosensor. Upon activation, the product targeting sequence will naturally dominate the 
5 localization of its associated detectable sequence domain because it is then separated 
from the reactant targeting sequence. 

An added benefit of using this biosensor is that the product is targeted, and thus 
concentrated, into a smaller region of the cell. Thus, smaller amounts of product are 
detectable due to the increased concentration of the product. This concentration effect 
10 is relatively insensitive to the cellular concentration of the reactant. The signal-to-noise 
ratio (SNR) of such a measurement is improved over the more dispersed distribution of 
biosensor #1. 

Similar biosensors that incorporate either the caspase 6 (SEQ ED NO:66) 
(Figure 29B) or the caspase 8 protease recognition sequence (SEQ ID NO:74) (Figure 
15 29B) can be made using the methods described above, but using the following primer 
sets: 

Primers for Caspase 6, Product target sequence = NLS (CP6GFPNLS- 
CYTO) 

1 ) TCA TC A TCC GG A AG A AGG AAA CG A CAA AAG CGA TCG 
20 ACA AGA CTT GTT GAA ATT GAG AAG (SEQ ID NO:159) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC 
ACC CTC CAA GCT GAG CTT GCA CAG GAT TTC GTG GAC 
AGT AGA CAT AGT ACT GTT GTC AAT TTC (SEQ ID NO: 160) 

25 3) TCA TCA TCC GGA AGA AGG (SEQ ID NO:158) 
4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

Primers for Caspase 8, Product target sequence == NLS (CP8GFPNLS-CYTO) 

1) TCA TCA TCC GGA AGA AGG AAA CGA CAA AAG CGA TCG 

30 TAT CAA AAA GGA ATA CCA GTT GAA ACA GAC AGC GAA GAG 
CAA CCT TAT (SEQ ID NO:161) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC ACC CTC 
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CAA GCT GAG CTT GCA GAG GAT TTC GTG GAC AGT AGA CAT ACT 
ACT ATA AGG TTG CTC (SEQ ID NO:162) 

3) TCA TCA TCC GGA AGA AGG (SEQ ffi NO:158) 

4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

5 

The sequence of the resulting biosensors is shown in SEQ ID NO:13-14 
(Caspase 6) and SEQ ID NO: 15-16 (Caspase 8). Furthermore, multiple copies of the 
protease recognition sites can be inserted into the biosensor, yielding the biosensors 
shown in SEQ ID NO: 17-18 (Caspase 3) and SEQ ID NO:19-20 (Caspase 8). 

10 

2. Caspase 3 biosensor with a second signal domain 

An alternative embodiment employs a second signal domain operatively 
linked to the reactant target domain. In this example, full length MAP4 serves as the 
reactant target sequence. Upon recognition and cleavage, one product of the reaction, 

15 containing the reactant target sequence, remains bound to microtubules in the 
cytoplasm with its own unique signal, while the other product, containing the product 
target sequence, diffuses into the nucleus. This biosensor provides a means to measure 
two activities at once: caspase 3 activity using a translocation of GFP into the nucleus 
and microtubule cytoskeleton integrity in response to signaling cascades initiated 

20 during apoptosis, monitored by the MAP4 reactant target sequence. 

The basic premise for this biosensor is that the reactant is tethered to the 
microtubule cytoskeleton by virtue of the reactant target sequence comprising the full 
length microtubule associated protein MAP4 (SEQ ID NO:152) (Figure 29C) In this 
case, a DEVD (SEQ ID NO:60) (Figure 29B) recognition motif is located between the 
25 EYFP signal (SEQ ID NO:44) (Figure 29A) operatively linked to the reactant target 
sequence, as well as the EBFP signal (SEQ ID NO:48) (Figure 29A) operatively 
linked to the C-terminus of MAP4. The resulting biosensor is shown in SEQ ID 
NO:21-22. 

This biosensor can also include a product targeting domain, such as an NLS, 
30 operatively linked to the signal domain. 

With this biosensor, caspase-3 cleavage still releases the N-terminal GFP, which 
undergoes translocation to the nucleus (directed there by the NLS). Also, the MAP4 
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fragment, which is still intact following proteolysis by caspase-3, continues to report on 
the integrity of the microtubule cytoskeleton during the process of apoptosis via the 
second GFP molecule fused to the C-terminus of the biosensor. Therefore, this single 
chimeric protein allows simultaneous analysis of caspase-3 activity and the 

5 polymerization state of the microtubule cytoskeleton during apoptosis induced by a 
variety of agents. This biosensor is also useful for analysis of potential drug candidates 
that specifically target the microtubule cytoskeleton, since one can determine whether a 
particular drug induced apoptosis in addition to affecting microtubules. 

This biosensor potentially combines a unique signal for the reactant, 

10 fluorescence resonance energy transfer (FRET) from signal 2 to signal 1, and a unique 
signal localization for the product, nuclear accumulation of signal 1. The amount of 
product generated will also be indicated by the magnitude of the loss in FRET, but this 
will be a smaller SNR than the combination of FRET detection of reactant and spatial 
localization o f the product. 

15 FRET can occur when the emission spectrum of a donor overlaps significantly 

the absorption spectmm of an acceptor molecule, (dos Remedies, C.G., and P.D. 
Moens, 1995. Fluorescence resonance energy transfer spectroscopy is a reliable "ruler" 
for measuring structural changes in proteins. Dispelling the problem of the unknown 
orientation factor. J Struct BioL 115:175-85; Enmianouilidou, E., A.G. Teschemacher, 

20 A.E. Pouli, L.I, NichoUs, E.P. Seward, and G.A. Rutter. 1999. Imaging Ca(2+) 
concentration changes at the secretory vesicle surface with a recombinant targeted 
cameleon. CurrBioL 9:915-918.) The average physical distance between the donor and 
acceptor molecules should be between 1 nm and 10 nm with a preference of between 1 
nm and 6 nm. The intervening sequence length can vary considerably since the three 

25 dimensional structure of the peptide will determine the physical distance between donor 
and acceptor. This FRET signal can be measured as (1) the amount of quenching of the 
donor in the presence of the acceptor, (2) the amount of acceptor emission when 
exciting the donor, and/or (3) the ratio between the donor and acceptor emission. 
Altematively, fluorescent lifetimes of donor and acceptor could be measured. 

30 This case adds value to the above FRET biosensor by nature of the existence of 

the reactant targeting sequence. This sequence allows the placement of the biosensor 
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into specific compartments of the cell for a more direct readout of activity in those 
compartments such as the inner surface of the plasma membrane. 

The cytoplasmic second signal represents both original reactant plus one part of 
the product. The nuclear first signal represents another product of the reaction. Thus the 
enzymatic reaction has the added flexibility in that it can be represented as (1) nuclear 
intensity; (2) the nucleus /cytoplasm ratio; (3) the nucleus /cytoplasm FRET ratio; (4) 
cytoplasmic /cytoplasmic FRET ratio. 

The present FRET biosensor design differs from previous FRET-based 
biosensors (see WO 97/28261; W09837226) in that it signal measurement is based on 
spatial position rather than intensity. The products of the reaction are segregated from 
the reactants. It is this change in spatial position that is measured. The FRET-based 
biosensor is based on the separation, but not to another compartment, of a donor and 
acceptor pair. The intensity change is due to the physical separation of the donor and 
acceptor upon proteolytic cleavage. The disadvantages of FRET-based biosensors are 
(1) the SNR is rather low and difficult to measure, (2) the signal is not fixable. It must 
be recorded using living cells. Chemical fixation, for example with formaldehyde, 
cannot preserve both the parent and resultant signal; (3) the range of wavelengths are 
limiting and cover a larger range of the spectrum due to the presence of two 
fluorophores or a fluorophore and chromophore; (4) the constmction has greater 
limitations in that the donor and acceptor must be precisely arranged to ensure that the 
distance falls within 1 - 1 0 nm. 

Benefits of the positional biosensor includes: (1) ability to concentrate the 
signal in order to achieve a higher SNR. (2) ability to be used with either living or fixed 
cells; (3) only a single fluorescent signal is needed; (4) the arrangement of the domains 
of the biosensor is more flexible. The only limiting factor in the application of the 
positional biosensor is the need to define the spatial position of the signal which 
requires an imaging method with sufficient spatial resolution to resolve the difference 
between the reactant compartment and the product compartment. 

One of skill in the art will recognize that this approach can be adapted to report 
any desired combination of activities by simply making the appropriate substitutions 
for the protease recognition sequence and the reactant target sequence, including but 
not limited to those sequences shown in Figure 29A-C. 
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3. Caspase 8 biosensor with a nucleolar localization domain (CP8GFPNUC- 
CYTO) 

This approach (diagrammed in Figure 34) utilizes a biosensor for the detection 
5 of caspase-8 activity. In this biosensor, a nucleolar localization signal 
(RKJRIRTYLKSCRRMKJISGFEMSRPIPSHLT) (SEQ ID NO:130) (Figure 29C) 
(Ueki et aL, Biochem. Biophys. Res. Comm. 252:97-100, 1998) was used as the 
product target sequence, and made by PGR using the primers described below. The 
PGR product was digested with BspEl and Pvul and gel purified. The vector and the 
10 PGR product were ligated as described above. 

Primers for Caspase 8, Nucleolar localization signal (CP8GFPNUC-CYTO): 

1) TCA TCA TGGGGA AGA AAA GGT ATA GGT ACT TAG GTC AAG 
15 TCG TGG AGG CGG ATG AAA AGA (SEQ ID NO: 163) 

2) G AA G AA CGA TCG AGT AAG GTG GGA AGG AAT AGG TCG AGA 
CAT CTC AAA ACC ACT TCT TTT CAT (SEQ ID NO:164) 

3) TCA TCA TCG GGA AGA AAA (SEQ ID NO: 165) 

4) GAA GAA CGA TCG AGT AAG (SEQ ID NO: 166) 

20 The sequence of the resulting biosensor is shown in SEQ JD NO: 23-24. This 

biosensor includes the protease recognition site for caspase-8 (SEQ ED NO: 74) 
(Figure 29B). A similar biosensor utilizes the protease recognition site for caspase-3. 
(SEQ ID NO:25-2iS) 

These biosensors could be used with other biosensors that possess the same 

25 product signal color that are targeted to separate compartments, such as CP3GFPNLS- 
CYTO. The products of each biosensor reaction can be uniquely measured due to 
separation of the products based on the product targeting sequences. Both products 
fi-om CP8GFPNUC-CYT0 and CP3GFPNLS-CYTO are separable due to the different 
spatial positions, nucleus vs. nucleolus, even though the colors of the products are 

30 exactly the same. Assessing the non-nucleolar, nuclear region in order to avoid the 

spatial overlap of the two signals would perforai the measurement of CP3GFPNLS in 
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the presence of CP8GFPNUC. The loss of the nucleolar region from the nuclear signal 
is insignificant and does not significantly affect the SNR. The principle of assessing 
multiple parameters using the same product color significantly expands the number of 
parameters that can be assessed simultaneously in living cells. This concept can be 

5 extended to other non-overlapping product target compartments. 

Measurement of translocation to the nucleolar compartment is performed by (1) 
defining a mask corresponding to the nucleolus based on a nucleolus-specific marker, 
including but not limited to an antibody to nucleolin (Lischwe et aL, 1981. Exp. Cell 
Res. 136:101-109); (2) defining a mask for the reactant target compartment, and (3) 

10 determining the relative distribution of the signal between these two compartments. 
This relative distribution could be represented by the difference in the two intensities 
or, preferably, the ratio of the intensities between compartments. 

The combination of multiple positional biosensors can be complicated if the 
reactant compartments are overlapping. Although each signal could be measured by 

15 simply detemiining the amount of signal in each product target compartment, higher 
SNR will be possible if each reactant is uniquely identified and quantitated. This higher 
SNR can be maximized by adding a second signal domain of contrasting fluorescent 
property. This second signal may be produced by a signal domain operatively linked to 
the product targeting sequence, or by FRET (see above), or by a reactant targeting 

20 sequence uniquely identifying it within the reactant compartment based on color, 
spatial position, or fluorescent property including but not limited to polariMtion or 
lifetime. Alternatively, for large compartments, such as the cytoplasm, it is possible to 
place different, same colored biosensors in different parts of the same compartment. 

25 4. Protease biosensors with multiple copies of a second signal domain serving 
as a reactant target domain 

In another example, (CPSYFPNLS-SIZECFPn) increasing the size of the 
reactant is accomplished by using multiple inserts of a second signal sequence, for 
example, ECFP (SEQ ID NO:50) (Figure 29A) (Tsien, RY. 1998. Annu Rev 
30 Biochem. 67:509-44). Thus, the multiple copies of the second signal sequence serve as 
the reactant target domain by excluding the ability of the biosensor to diffuse into the 
nucleus. This type of biosensor provides the added benefit of additional signal being 
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available per biosensor molecule. Aggregation of multiple fluorescent probes also can 
result in unique signals being manifested, such as FRET, self quenching, eximer 
foraiation, etc. This could provide a unique signal to the reactants. 

5 5. Tetanus/botulinum biosensor with trans-membrane targeting 

domain 

In an alternative embodiment, a trans-membrane targeting sequence is used to 
tether the reactant to cytoplasmic vesicles, and an alternative protease recognition site 
is used. The tetanus/botulinum biosensor (SEQ ID NOS:27-28 (cellubrevin); 29-30 

10 (synaptobrevin) consists of an NLS (SEQ ID NO:128) (Figure 29C), Fret25 signal 
domain (SEQ ID NO:52) (Figure 29A), a tetanus or botulinimi zinc metalloprotease 
recognition site from cellubrevin (SEQ ID NO:106) (Figure 29B) (McMahon et al., 
Nature 364:346-349, 1993; Martin et al., J. Cell Biol., in press) or synaptobrevin (SEQ 
ID NO: 108) (Figure 29B) (GenBank Accession #U64520), and a trans-membrane 

15 sequence from cellubrevin (SEQ ID NO:146) (Figure 29C) or synaptobrevin (SEQ ID 
NO:144) (Figure 29C) at the 3 '-end which tethers the biosensor to cellular vesicles. 
The N-terminus of each protein is oriented towards the cytoplasm. In the intact 
biosensor, GFP is tethered to the vesicles. Upon cleavage by the tetanus or botulinum 
zinc metalloprotease, GFP will no longer be associated with the vesicle and is free to 

20 diffuse throughout the cytoplasm and the nucleus. 

b. 5-domain biosensors 

1. Caspase 3 biosensor with a nuclear localization domain and a 
second signal domain operatively linked to an annexin II domain 

25 The design of this biosensor is outlined in Figure 35, and the sequence 

is shown in SEQ ID NO:33-34. This biosensor differs from SEQ ID NO 11-12 by 
including a second detectable signal, ECFP (SEQ ID NO:50) (Figure 29 A) (signal 2) 
operatively linked to the reactant target sequence. 

30 2. Caspase 3 biosensor with a nuclear localization sequence and a 

second signal domain operatively linked to a MAP4 projection domain 
(CP3YFPNLS-CFPCYTO) 
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In this biosensor (SEQ ID NO:31-32), an NLS product targeting domain (SEQ 
ID NO:128) (Figure 29C) is present upstream of an EYFP signal domain (SEQ ID 
NO:44) (Figure 29A). A DEVD protease recognition domain (SEQ ID NO:60) 
(Figure 29B) is between after the EYFP signal domain and before the MAP4 
5 projection domain (SEQ ID NO:142) (Figure 29C). 

Example 11. Fluorescent Biosensor Toxin Characterization 

As used herein, "toxin" refers to any organism, macromolecule, or organic or 
inorganic molecule or ion that alters normal physiological processes found within a 
10 cell, or any organism, macromolecule, or organic or inorganic molecule or ion that 
alters the physiological response to modulators of known physiological processes. 
Thus, a toxin can mimic a normal cell stimulus, or can alter a response to a normal cell 
stimulus. 

Living cells are the targets of toxic agents that can comprise organisms, 

15 macromolecules, or organic or inorganic molecules. A cell-based approach to toxin 
detection, classification, and identification would exploit the sensitive and specific 
molecular detection and amplification system developed by cells to sense minute 
changes in their external milieu. By combining the evolved sensing capabiHty of cells 
with the luminescent reporter molecules and assays described herein, intracellular 

20 molecular and chemical events caused by toxic agents can be converted into detectable 
spatial and temporal luminescent signals. 

When a toxin interacts with a cell, whether it is at the cell surface or within a 
specific intracellular compartment, the toxin invariably undermines one or more 
components of the molecular pathways active within the cell. Because the cell is 

25 comprised of complex networks of interconnected molecular pathways, the effects of a 
toxin will likely bfe transmitted throughout many cellular pathways. Therefore, our 
strategy is to use molecular markers within key pathways likely to be affected by 
toxins, including but not limited to cell stress pathways, metabolic pathways, signaling 
pathways, and growth and division pathways. 

30 We have developed and characterized three classes of cell based luminescent 

reporter molecules to serve as reporters of toxic threat agents. These 3 classes are as 
follows: 
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(1) Detectors: general cell stress detection of a toxin; 

(2) Classifiers: perturbation of key molecular pathway(s) for detection and 
classification of a toxin; and 

(3) Identifiers: activity mediated detection and identification of a toxin or a 
5 group of toxins. 

Thus, in another aspect of the present invention, living cells are used as 
biosensors to interrogate the environment for the presence of toxic agents. In one 
embodiment of this aspect, an automated method for cell based toxin characterization is 
disclosed that comprises providing an array of locations containing cells to be treated 

ID with a test substance, wherein the cells possess at least a first luminescent reporter 
molecule comprising a detector and a second luminescent reporter molecule selected 
from the group consisting of a classifier or an identifier; contacting the cells with the 
test substance either before or after possession of the first and second luminescent 
reporter molecules by the cells; imaging or scanning multiple cells in each of the 

15 locations containing multiple cells to obtain luminescent signals from the detector; 
converting the luminescent signals from the detector into digital data to automatically 
measure changes in the localization, distribution, or activity of the detector on or in the 
cell, which indicates the presence of a toxin in the test substance; selectively imaging or 
scanning the locations containing cells that were contacted with test sample indicated to 

20 have a toxin in it to obtain luminescent signals from the second reporter molecule; 
converting the luminescent signals from the second luminescent reporter molecule into 
digital data to automatically measure changes in the localization, distribution, or 
activity of the classifier or identifier on or in the cell, wherein a change in the 
localization, distribution, stracture or activity of the classifier identifies a cell pathway 

25 that is perturbediby the toxin present in the test substance, or wherein a change in the 
localization, distribution, structure or activity of the identifier identifies the specific 
toxin that is present in the test substance. In a preferred embodiment, the cells possess 
at least a detector, a classifier, and an identifier. In a further preferred embodiment, the 
. digital data derived from the classifier is used to determine which identifier(s) to 

30 employ for identifying the specific toxin or group of toxins. 

As used herein, the phrase "the cells possess one or more luminescent reporter 
molecules" means that the luminescent reporter molecule may be expressed as a 
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luminescent reporter molecule by the cells, added to the cells as a luminescent reporter 
molecule, or luminescently labeled by contacting the cell with a luminescently labeled 
molecule that binds to the reporter molecule, such as a dye or antibody, that binds to the 
reporter molecule. The luminescent reporter molecule can be expressed or added to the 
5 cell either before or after treatment with the test substance. 

The luminescent reporters comprising detectors, classifiers, and identifiers may 
also be distributed separately into single or multiple cell types. For example, one cell 
type may contain a toxin detector, which, when activated by toxic activity, implies to 
the user that the same toxin sample should be screened with reporters of the classifier 

10 or identifier type in yet another population of cells identical to or different from the 
cells containing the toxin detector. 

The detector, classifier, and identifier can comprise the same reporter molecule, 
or they can comprise different reporters. 

Screening for changes in the localization, distribution, structure or activity of 

15 the detectors, classifiers, and/or identifiers can be carried out in either a high 
throughput or a high content mode. In general, a high-content assay can be converted 
to a high-throughput assay if the spatial information rendered by the high-content assay 
can be recoded in such a way as to no longer require optical spatial resolution on the 
cellular or subcellular levels. For example, a high-content assay for microtubule 

20 reorganization can be carried out by optically resolving luminescently labeled cellular 
microtubules and measuring their morphology (eg., bundled vs. non-bundled or 
normal). A high-throughput version of a niicrotubule reorganization assay would 
involve only a measurement of total microtubule polymer mass after cellular extraction 
with a detergent That is, destabilized microtubules, being more easily extracted, would 

25 . result in a lower total microtubule mass luminescence signal than unperturbed or drug- 
stabilized luminescently labeled microtubules in another treated cell population. The 
luminescent signal emanating fi"om a domain containing one or more cells will 
therefore be proportional to the total microtubule mass remaining in the cells after toxin 
treatment and detergent extraction. 

30 The methods for detecting, classifying, and identifying toxins can utilize the 

same screening methods described throughout the instant application, including but not 
limited to detecting changes in cytoplasm to nucleus translocation, nucleus or nucleolus 
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to cytoplasm translocation, receptor internalization, mitochondrial membrane potential, 
signal intensity, the spectral response of the reporter molecule, phosphorylation, 
intracellular free ion concentration, cell size, cell shape, cytoskeleton organization, 
metabolic processes, cell motility, cell substrate attachment, cell cycle events, and 
5 organellar structure and function. 

In all of these embodiments, the methods can be operated in both toxin-mimetic 
and toxin-inhibitory modes. 

Such techniques to assess the presence of toxins are useful for methods 
including, but not limited to, monitoring the presence of envirormniental toxins in test 

10 samples and for toxins utilized in chemical and biological weapons; and for detecting 
the presence and characteristics of toxins during environmental remediation, drug 
discovery, clinical applications, and during the normal development and manufacturing 
process by virtually any type of industry, including but not limited to agriculture, food 
processing, automobile, electronic, textile, medical device, and petroleiun industries. 

15 We have developed and characterized examples of luminescent cell-based 

reporters, distributed across the 3 sensor classes. The methods disclosed herein can be 
utilized in conjunction with computer databases, and data management, mining, 
retrieval, and display methods to extract meaningful patterns from the enormous data 
^ set generated by each individual reporter or a combinatorial of reporters in response to 

20 • toxic agents. Such databases and bioinformatics methods include, but are not limited 
to, those disclosed in U.S. Patent Application Nos. 09/437,976, filed November 10, 
1999; 60/145,770 filed July 27, 1999 and U.S. Patent Application Serial No. to be 
assigned, filed February 19, 2000. (98,068-C) 

Any cell type can be used to carry out this aspect of the invention, including 

25 prokaryotes such as bacteria and archaebacteria, and eukaryotes, such as single celled 
fungi (for example, yeast), molds (for example, Dictyostelium), and protozoa (for 
example, Euglena). Higher eukaryotes, including, but not limited to, avian, amphibian, 
insect, and mammaUan cells can also be used. 
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Examples of Biosensors 



Number | Name | Class | Cell Types | Response to model toxins 
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Positive Negative 


1 


Mitochondrial 
Potential 

[Donnan Equilibrium 
Dye] 


D 


• LLCPK (pig epithelia) 

• Rat primary hepatocytes 


Valinomycin Oligomycin 

(lOnM-lOOMM) (10 nM) 

FCCP 

(lOnM-IOOfiM) 


2 


Heat Shock Protein 
(Hsp 27, Hsp 70) 
GFP-chimera 


D 


• HeLa 

• 3T3 


Cadmium TNF-a 
(lOmM) (lOOng/^ml) 


3 


Tubulin- 
cytoskeieton 
[P-tubulin-GFP 
chimera] 


C 


• BHK 

• HeLa 

• LLCPK 


Paclitaxel Staurosporine 

(10nM-20^M) (I nM-1 piM) 

Curacin-A 

(5 nM-10^M) 

Nocadazole 

(7nM.|2(iM) 

Colchicine 

(5 nM-10^M) 

Vinblastine 

(5nM-10uM) 


4 


pp38 MAPK- stress 
signaling 

[antibody and GFP- 
chimera] 


C 


• 3T3 

• LLCPK 


Anisomycin TNF-a 

(lOOpM) (lOOng/ml) 

Cadmium 

<10mM) 


J 


NF-kB- stress 
signaling 

[antibody and GFP- 
chimeral 




• rieL*a 

• 3T3 

• BHK 

_ OVTO 1 ft 

• HepG2 

• LLCPK 


1 N r -a Amsomycm 
(lOOng/mI-0 J8pg/ml) (10 nM-IO nM) 
IL- 1 Cadmium 
(4ng^I-.095pg/mI) (1-10 ^M) 
Nisin Penitrem A 

(2>IOOO)ig^l) (10 MM) 

Streptolysin Valinomycin. 

(lOMg/ml) (1 mM) 

Anisomycin 

(tOO^M) 


6 


IkB 

[complement to NF- 
kB] 


c 


In many cell types 




7 


Tetanus Toxin 
[Protease activity- 
based sensorl 


I 


In many cell types 




8 


Anthrax LF 
[Protease activity- 
based sensor] 


I 


In many cell types 





Sensor Class: D=* Detector of toxins; C- Classifier of toxins; 1= Identifier of toxin or group of toxins 
The model toxins can generally be purchased from Sigma Chemical Company (St Louis, MO) 



Examples of Detectors: This class of sensors provides a first line signal that 
indicates the presence of a toxic agent. This class of sensors provides detection of 
general cellular stress that requires resolution limited only to the domain over which the 
measurement is being made» and they are amenable to high content screens as well. 
Thus, either high throughput or high content screening modes may be used, including 
but not limited to translocation of heat shock factors from the cytoplasm to the nucleus. 
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and changes in mitochondrial membrane potential, intracellular free ion concentration 
detection (for example, Ca^"^; H"^, general metabolic status, cell cycle timing events, 
and organellar structure and function, 

5 L Mitochondrial Potential 

A key to maintenance of cellular homeostasis is a constant ATP energy charge. 
The cycling of ATP and its metaboUtes ADP, AMP, inorganic phosphate, and solution- 
phase protons is continuously adjusted to meet the catabolic and anabolic needs of the 
celL Mitochondria are primarily responsible for maintaining a constant energy charge 

10 throughout the entire cell. To produce ATP from its constituents, mitochondria must 
maintain a constant membrane potential within the organelle itself. Therefore, 
measurement of this electrical potential with specific luminescent probes provides a 
sensitive and rapid readout of cellular stress. 

We have utilized a positively charged cyanine dye, JC-1 (Molecular Probes, 

15 Eugene, OR), which diffuses into the cell and readily partitions into the mitochondrial 
membrane, for measurement of mitochondrial potential. The photophysics of JC-1 are 
such that when the probe partitions into the mitochondrial membrane and it experiences 
an electrical potential >140 raV, the probe aggregates and its spectral response is 
shifted to the red. At^ membrane potential values^ <140 mV, JC-1 is primarily 

20 monomeric and its spectral response is shifted toward the blue. Therefore, the ratio of 
two emission wavelengths (645 mn and 530 nm) of JC-1 partitioned into mitochondria 
provides a sensitive and continuous measure of mitochondrial membrane potential. 

We have been making live cell measurements in a high throughput mode as the 
basis of a generalized indicator of toxic stress. The goal of our initial experiments was 

25 to determine the ratio of J-aggregates of JC-1 dye to its monomeric form both before 
and after toxic stress. 
Procedure 

1 . Cells were plated and cultured up to overnight. 

2. Cells were stained with JC-1 (10 \xg/ml) for 30 minutes at 37° C in a CO2 incubator, 
30 3 . Cells were then washed quickly with HBSS at 37°C (2 times, 1 50 jil/well), the 

toxins were added if required, and the entire plate was scanned tn a plate reader. 
The JC-1 monomer was measured optimally with a 485 nm excitation/530 nm 
emission wavelength filter set, and the aggregates were best measured with a 590 
nm excitation/645 nm emission wavelength set. 

35 
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Results 

The mitochondrial potential within several types of living cells, and the effects 
of toxins on the potential were measured using the fluorescence ratio Em 645 (590)/ 
5 Em 530 (485) (excitation wavelengths in parentheses). For example, we measured the 
effect of 10 \xM valinomycin on the mitochondrial potential within LLCPK cells (pig 
epithelia). Within seconds of treatment, the toxin induced a more rapid and higher 
magnitude decrease (an approximately 50% reduction) in mitochondrial potential than 
that found in untreated cells. Hepatocytes were also determined to be sensitive to 

10 valinomycin, and the changes in mitochondrial potential were nearly complete within 
seconds to minutes after addition of various concentrations of the toxin. 

These results are consistent with mitochondrial potential being a model 
intracellular detector of cell stress. Because these measurements require no spatial 
resolution within individual cells, mitochondrial potential measurements can be made 

15 rapidly on an entire cell array (e.g. high throughput). This means, for example, that 
complex arrays of many cell types can be probed simultaneously and continuously as a 
generalized toxic response. Such an indicator can provide a first line signal to indicate 
that a general toxic, stress is present in a sample. Further assays can then be conducted 
to more specifically identify the toxin using cells classifier or identifier type reporter 

20 molecules. 

2. Heai Shock Proteins 

Most mammalian cells will respond to a variety of environmental stimuli with 
the induction of a family of proteins called stress proteins. Anoxia, amino acid 

25 analogues, sulfhydryl-reacting reagents, transition metal ions, decouplers of oxidative 
phosphorylation, viral infections, ethanol, antibiotics, ionophores, non-steroidal 
antiinflammatory dnigs, theraial stress and metal chelators are all inducers of cell stress 
protein synthesis, function, or both. Upon induction, cell stress proteins play a role in 
folding and unfolding proteins, stabilizing proteins in abnormal configurations, and 

30 repairing DNA damage. 

There is evidence that at least, four heat shock proteins translocate from the 
cytoplasm to the nucleus upon stress activation of the cell. These proteins include the 
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heat shock proteins HSP27 and HSP70, the heat shock cognate HSC70, and the heat 
shock transcription factor HSFl. Therefore, measurement of cytoplasm to nuclear 
translocation of these proteins (and other stress proteins that translocate from the 
cytoplasm to the nucleus upon a cell stress) will provide a rapid readout of cellular 
5 stress. 

We have tested the response of an HSP27-GFP biosensor (SEQ ID 169-170) in 
two cell Hues (BHK and HeLa) using a library of heavy metal chemical compounds as 
biological toxin stimulants to stress the cells. Briefly, cells expressing the HSP27-GFP 
biosensor are plated into 96-well microplates, and allowed to attach. The cells are then 
10 treated with a panel of cell stress-inducing compounds. Exclusively cytoplasmic 
localization of the fusion protein was found in unstimulated cells. 

Other similar heat shock protein biosensors (HSP-70, HSC70, and HSFl fused 
to GFP) can be used as detectors, and are shown in SEQ ED NO: 171-176. 

15 ' 

Examples of Classifiers: 

This class of sensors detects the presence of, and further classifies toxins by 
identifying the cellular pathway(s) perturbed by the toxin. As such, this suite of sensors 
can detect and/or classify toxins into broad categories, including but not limited to 

20 "toxins affecting signal transduction/' "toxins affecting the cytoskeleton," and "toxins 
affecting protein synthesis". Either high throughput or high content screening modes 
may be used. Classifiers can comprise compoxmds including but not limited to tubulin, 
microtubule-associated proteins, actin, actin-binding proteins including but not limited 
to vinculin, a-actinin, actin depolymerizing factor/cofilin, profilin, and myosin; NF-kB, 

25 IicB, GTP-binding proteins including but not limited to rac, rho, and cdc42, and stress- 
activated protein kinases including but not limited to p38 mitogen-activated protein 
kinase. 

7. Tubulin-cvtoskeleton 
30 The cell cytoskeleton plays a major role in cellular functions arid processes, 

such as endo- and exocytosis, vesicle transport, and mitosis. Cytoskeleton-affecting 
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toxins, of proteinaceous and non-proteinaceous form, such as C2 toxin, and several 
classes of enterotoxins, act either directly on the cytoskeleton, or indirectly via 
regulatory components controlling the organization of the cytoskeleton. Therefore, 
measurement of structural changes in the cytoskeleton can provide classification of the 
toxin into a class of cytoskeleton-affecting toxins. This assay can be conducted in a 
high content mode, as described previously, or in a high throughput mode. For high 
throughput as discussed previously. j 

Such measurements will be valuable for identification of toxins including, but 
not limited to anti-microtubule agents, agents that generally affect cell cycle 
progression and cell proliferation, intracellular signal transduction, and metabolic 
processes. 

For microtubule disruption assays, LLCPK cells stably transfected with a 
tubulin-GFP biosensor plasmid were plated on 96 well cell culture dishes at 50-60% 
confluence and cultured overnight at 37 ^^C, 5% CO2. A series of concentrations (10- 
500 nM) of 5 compounds (paclitaxel, curacin A, nocodazole, vinblastine, and 
colchicine) in normal culture media were freshly prepared from stock, and were added 
to cell culture dishes to replace the old culture media. The cells were then observed 
with the cell screening system described above, at a 12 hour time point. 

Our data indicate that the tubulin chimera localizes to and assembles into 

microtubules throughout the cell. The microtubule arrays in cells expressing the 

chimera respond as follows to a variety of anti-microtubule compounds: 

Drug Response 

Vinblastine Destabilization 

Nocodazole Destabilization 

Paclitaxel Stabilization 

Colchicine Destabilization 

Cxuracin A Destabilization 

Similar data were obtained using cells expressing the tubulin biosensor that 
were patterned onto cell arrays (such as those described in U.S. Patent Application 
Serial No. 08/865,341 filed May 29, 1997, incorporated by reference herein in its 
entirety) and dosed as above. 
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NF-kB is cytoplasmic at basal levels of stimulation, but upon insult translocates 
to the nucleus where it binds specific DNA response elements and activates 
transcription of a niunber of genes. Translocation occurs when IkB is degraded by the 
5 proteosome in response to specific phosphorylation and ubiquitination events. IkB 
normally retains NF-kB in the cytoplasm via direct interaction with the protein, and 
masking of the NLS sequence of NF-icB. Therefore, although not the initial or defining 
event of the whole signal cascade, NF-kB translocation to the nucleus can serve as an 
indicator of cell stress. 

10 We have generated an NF-kB-GFP chimera for analysis in live cells. This was 

accomplished using standard polymerase chain reaction techniques using a 
characterized NF-kB p65 cDNA purchased from Invitrogen (Carlsbad, CA) fused to an 
EYFP PCR amplimer that was obtained from Clontech Laboratories (Palo Alto, CA). 
The resulting chimera is shown in SEQ ID NO:177-178. The two PCR products were 

15 ligated into an eukaryotic expression vector designed to produce the chimeric protein at 
high levels vising the ubiquitous CMV promoter. 

NF-kB immunolocalization 

20 For further studies, we characterized endogenous NF-kB activation by 

immunolocalization in toxin treated cells. The NF-kB antibodies, used in this study 
were purchased from Santa Cruz Biotechnology, Inc. (Santa Cruz, CA), and secondary 
antibodies are from Molecular Probes (Eugene, OR), 

For the 3T3 and SNB19 cell types, we determined the effective concentrations 

25 that yield response levels of 50% of the maximum (EC50), expressed in units of mass 
per volume (ng/ml) and units of molarity. Based on molecular weights of 17 kD for 
both TNFa and IL-la, the EC50 levels for these two compounds with 3T3 and SNB19 
cell types are given in units of molarity in Table 1. Our results demonstrated 
reproducibiUty of the relative responses from zero to maximum dose, but from sample 

30 to sample there have been occasional shifts in the baseline intensities of the response at 
zero concentration. 
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For these experiments, either 10 or 100 TNFa-treated 3T3 or SNB19 cells/well 
were tested. On the basis of the standard deviations measured for these samples, and 
by taking t- values for the student's t-test, we h£^ve estimated the minimum detectable 
doses for each case of cell type, compound, nmnber of cells per well, and for different 
5 choices of how many wells are sampled per condition. The latter factor determines the 
number of degrees of freedom that are provided in the sample of data. Increasing the 
number of wells from 4 to 16, and increasing the number of cells per well from 10 to 
100, improves the minimum detectable doses considerably. For 3T3 cells, which show 
lower minimum detectable doses than the SNB19 cells, and for the case of 1% false 
10 negative and 1% false positive rates, we estimate that 100 cells per well and a sampling 
of 12 or 16 wells are sufficient to detect a dose approximately equal to the EC50 value 
of 0.15 ng/ml. If the false positive rate is relaxed to 20%, a concentration of 
approximately half that value can be detected (0.83 ng/ml). One hundred cells can 
conveniently be sampled over a cell culture surface area of less than 1 nun^. 

15 

Table 1 . EC50 levels for TNFa and IL-la (based on molecular weights of 17 kD for 
both) 



Compound 


Cell Type 


EC50 (10 -'^moles/liter) 








TNFa 


3T3 


8.8 




SNB19 


5.9 








IL-la 


3T3 


0.24 




SNB19 


59 



5. Phospho-p38 Mitogen Activated Protein Kinase CvpSSMAPK) 

MAPKs play a role in not only cell growth and division, but as mediators of 
cellular stress responses. One MAPK, p38, is activated by chemical stress inducers 
such as hyper-osmolar sorbitol, hydrogen peroxide, arsenite, cadmiiun ions, 
25 anisoraycin, sodium salicylate, and LPS. Activation of p38 is also accompanied by its 
translocation into the nucleus from the cytoplasm. 
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MAPK p38 lies in a pathway that is a cascade of kinases. Thus, p38 is a 
substrate of one or more kinases, and it acts to phosphorylate one or more substrates in 
time and space within the living cell. 

The assay we present here measures, as one of its parameters, p38 activation 
5 using immunolocalization of the phosphorylated form of p38 in toxin-treated cells. The 
assay was developed to be flexible enough to include the simultaneous measurement of 
other parameters within the same individual cells. Because the signal transduction 
pathway mediated by the transcription factor NF-kB is also known to be involved in the 
cell stress response, we included the activation of NF-kB as a second parameter in the 
10 same assay. 

Our experiments demonstrate an immimofluorescence approach can be used to 
measure p38 MAPK activation either alone or in combination with NF-kB activation in 
the same cells. Multiple cell types, model toxins, and antibodies were tested, and 
significant stimulation of both pathways was measured in a high-content mode. The 

15 phospho-p38 antibodies used in this study were purchased from Sigma Chemical 
Company (St. Louis, MO). We report that at least two cell stress signaling pathways 
can not only be meiasured simultaneously, but are differentially responsive to classes of 
model toxins. Figure 36 showis the differential response of flie p38 MAPK and NF-kB 
pathways across three model toxins and two different cell types. Note that when added 

20 alone, three of the model toxins (ILla, TNFa and Anisomycin) can be differentiated 
by the two assays as activators of specific pathways. 

IkB chimera 

IkB degradation is the key event leading to nuclear translocation of NF-kB and 
25 activation of the NFkB-mediated stress response. We have chosen this sensor to 
complement the NF-kB sensor as a classifier in a high-throughput mode: the 
measurement of loss of signal due to degradation of the IkB-GFP fusion protein 
requires no spatial resolution within individual cells, and as such we envision IkB 
degradation measurements being made rapidly on an entire cell substrate. 
30 This biosensor is based on fusion of the first 60 amino acids of HcB to the 

Fred25 variant of GFP. SEQ ID 179-180 This region of IkB contains all the regulatory 
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sequences, including phosphorylation sites and ubiquitination sites, necessary to confer 
proteosome degradation upon the biosensor. Knowing this, stimulation of any pathway 
that would typically lead to NFkB translocation results in degradation of this biosensor. 
Monitoring the fluorescence intensity of cells expressing IkB-GFP identifies the 
degradation process. 

Examples of Identifiers: 

In our toxin identification strategy, the first two levels of characterization ensure 
a rapid readout of toxin class without sacrificing the abiUty to detect many new mutant 
toxins or dissect several complex mixtures of known toxins. The third level of 
biosensors are identifiers, which can identify a specific toxin or group of toxins. In one 
embodiment, an identifier comprises a protease biosensor that responds to the activity 
of a specific toxin. Other identifiers are produced with reporters/biosensors specific to 
their activities. These include, but are not limited to post-translational modifications 
such as phosphorylation or ADP-ribosylation, translocation between cellular organelles 
or compartments, effects on specific organelles or cellular components (for example, 
membrane permeabilization, cytoskeleton rearrangement, etp.) 

ADP-ribosvlating toxins — These toxins include Pseudomonas toxin A, diptheria 
toxin, botulinum toxin, pertussis toxin, and cholera toxin. For example, C. botulinum 
C2 toxin induces the ADP-ribosylation of Argl77 in the cytoskeletal protein actin, thus 
altering its assembly properties. Besides the constmction of a classifier assay to 
measure actin-cytoskeleton regulation, an identifier assay can be constructed to detect 
the specific actin ADP-ribosylation. Because the ADP-ribosylation induces a 
conformational change that no longer permits the modified actin to polymerize, this 
conformational change can be detected intracellularly in several possible ways using 
luminescent reagents. For example, actin can be luminescently labeled using a 
fluorescent reagent with an appropriate excited state lifetime that allows for the 
measurement of the rotational diffusion of the intracellular actin using steady state 
fluorescence anisotropy. That is, toxin-modified actin will no longer be able to 
assemble into rigid filaments and will therefore produce only luminescent signals with 
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relatively low anisotropy, which can be readily measured with an imaging system. In 
another embodiment, actin can be labeled with a polarity-sensitive fluorescent reagent 
that reports changes in actin-conformation through spectral shifts of the attached 
reagent. That is, toxin-treatment will induce a conformational change in intracellular 
actin such that a ratio of two fluorescence wavelengths will provide a measure of actin 
ADP-ribosylation. 

Cvtotoxic phospholinases - Several gram-positive bacterial species produce 
cytotoxic phospholipases. For example, Clostridium perfringens produces a 
phospholipase C specific for the cleavage of phosphoinositides. These 
phosphoinositides (e.g., inositol 1,4,5-trisphosphate) induce the release of calcium ions 
from intracellular organelles. An assay that can be conducted as either high-content or 
high-throughput can be constmcted to measure the release of calcium ions using 
fluorescent reagents that have altered spectral properties when complexed with the 
metal ion. Therefore, a direct consequence of the action of a phospholipase C based 
toxin can be measured as a change in cellular calcimn ion concentration. 

Exfoliative toxins - These toxins are produced by several Staphylococcal 
species and can consist of several serotypes. A specific identifier for these toxins can be 
constructed by measuring the morphological changes in their target organelle, the 
desmosome, which occur at the junctions between cells. The exfoliative toxins are 
known to change the morphology of the desmosomes into two smaller components 
called hemidesmosomes. hi the high-content assay for exfoliative toxins, epithelial cells 
whose desmosomes are luminescently labeled are subjected to image analysis. An 
method that detects the morphological change between desmosomes and 
hemidesmosomes is used to quantify the activity of the toxins on the cells. 

Most of these identifiers can be used in high throughput assays requiring no 
spatial resolution, as well as in high content assays. 

Several biological threat agents act as specific proteases, and thus we have 
focused on the development of fluorescent protein biosensors that report the proteolytic 
cleavage of specific amino acid sequences found within the target proteins. 

A number of such protease biosensors (including FRET biosensors) are 
disclosed above, such as the caspase biosensors, anthrax, tetanus, Botulinum, and the 
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zinc metalloproteases. FRET is a powerful technique in that small changes in protein 
conformation, many of which are associated with toxin activity, can not only be 
measured with high precision in time and space within hving cells, but can be measured 
in a high-throughput mode, as discussed above. 

5 As described above, one of skill in the art will recognize that the protease 

biosensors of this aspect of the invention can be adapted to report the activity of any 
protease, by a substitution of the appropriate protease recognition site in any of the 
constructs (see Figure 29B). As disclosed above, these biosensors can be used in high- 
content or high throughput screens to detect in vivo activation of enzymatic activity by 
10 toxins, and to identify specific activity based on cleavage of a known recognition motif. 
These biosensors can be used in both live cell and fixed end-point assays, and can be 
combined with additional measurements to provide a multi-parameter assay. 

Anthrax LF 

15 Anthrax is a well-known agent of biological warfare and is an excellent target 

for development of a biosensor in the identifier class. Lethal factor (LF) is one of the 
protein components that confer toxicity to anthrax, and recently two of its targets within 
cells were identified. LF is a metalloprotease that specifically cleaves Mekl and Mek2 
proteins, kinases that are part of the MAP-kinase signaling pathway. Construction of 

20 lethal factor protease biosensors are described above. (SEQ ID NO:7-8; 9-10) Green 
fluorescent protein (GFP) is fused in-frame at the amino terminus of either Mekl or 
Mek2 (or both), resulting in a chimeric protein that is retained in the cytoplasm due to 
the presence of a nuclear export sequence (NES) present in both of the target 
molecules. Upon cleavage by active lethal factor, GFP is released from the chimera and 

25 is free to diffuse into the nucleus. Therefore, measuring the accumulation of GFP in the 
nucleus provides a direct measure of LF activity on its natural target, the living ceU. 

While a preferred form of the invention has been shown in the drawings and 
described, since variations in the preferred form will be apparent to those skilled in the 
art, the invention should not be construed as Umited to the specific form shown and 
30 described, but instead is as set forth in the claims. 
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CLAIMS 

We claim: 

1 . An automated method for cell based toxin characterization comprising 

-providing an array of locations containing cells to be treated with a test 
substance, wherein the cells possess at least a first luminescent reporter molecule 
comprising a detector and a second luminescent reporter molecule selected from the 
group consisting of a classifier or an identifier; 

-contacting the cells with the test substance either before or after possession of 
the first and second luminescent reporter molecules by the cells; wherein the 
localization, distribution, structure, or activity of the first and second luminescent 
reporter molecule is modified when the cell is contacted with the toxin, 

-imaging or scanning multiple cells in each of the locations containing multiple 
cells to obtain luminescent signals firom the detector; 

-converting the luminescent signals from the detector into digital data; 

-utilizing the digital data from the detector to automatically measure the 
localization, distribution, or activity of the detector on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the detector indicates the 
presence of a toxin in the test substance; 

-selectively, imaging or scanning the locations containing cells that ^were 
contacted with test sample indicated to have a toxin in it to obtain luminescent signals 
from the second reporter molecule; 

-converting the luminescent signals from the second luminescent reporter 
molecule into digital data; 

-utilizing the digital data from the second luminescent reporter molecule to 
automatically measure the localization, distribution, or activity of the classifier or 
identifier on or in the cell, wherein a change in the localization, distribution, stmcture 
or activity of the classifier identifies a cell pathway that is perturbed by the toxin 
present in the test substance, or wherein a change in the localization, distribution, 
structure or activity of the identifier identifies the specific toxin or group of toxins that 
are present in the test substance. 
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2. The method of claim 1 wherein the second luminescent reporter molecule is a 
classifier, and the digital data derived fi-om the classifier is used to select an appropriate 
identifier for identification of the specific toxin or group of toxins. 

5 3. An automated method for cell based toxin characterization comprising 

-providing an array of locations containing cells to be treated with a test 
substance, wherein the cells possess at least a first luminescent reporter molecule 
comprising a detector, a second luminescent reporter molecule comprising a classifier, 
and a third luminescent reporter molecule comprising an identifier; 
10 -contacting the cells with the test substance either before or after possession of 

the first second, and third luminescent reporter molecules by the cells; wherein the 
localization, distribution, structure, or activity of the first, second, and third luminescent 
reporter molecules is modified when the cell is contacted with the toxin, 

-imaging or scanning multiple cells in each of the locations containing multiple 
15 cells to obtain luminescent signals firom the detector; 

-converting the luminescent signals fi-om the detector into digital data; 
-utilizing the digital data fi*6m the detector to automatically measure the 
localization, distribution, or activity of the detector on or in the cell, wherein a change 
in the localization, ^distribution, structure or activity of the detector indicates the 
20 presence of a toxin in the test substance; 

-selectively imaging or scanning the locations containing cells that were 
contacted with test. sample indicated to have a toxin in it to obtain luminescent signals 
from the classifier; 

-converting the luminescent signals from the classifier into digital data; 
25 -utilizing the digital data from the classifier to automatically measure the 

localization, distribution, or activity of the classifier on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the classifier identifies a cell 
pathway that is perturbed by the toxin present in the test substance; 

—selectively imaging or scaiming the locations containing cells that were 
30 contacted with test sample indicated to have a toxin in it to obtain luminescent signals 
from the identifier; 

-converting the luminescent signals from the identifier into digital data; and 
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-Utilizing the digital data from the identifier to automatically measure the 
localization, distribution, or activity of the identifier on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the identifier identifies the 
specific toxin or group of toxins that is present in the test substance. 

5 

4. The method of claim 3 wherein the digital data derived from the classifier is 
used to select an appropriate identifier for identification of the specific toxin or group 
of toxins. 

10 5. The method of any one of claim 1-4 wherein the detector comprises a molecule 
selected from the group consisting of heat shock proteins and compoimds that respond 
to changes in mitochondrial membrane potential, intracellular free ion concentration, 
cytoskeletal organization, general metabolic status, cell cycle timing events, and 
organellar structure and function. 

15 " 

6. The method of any one of claim 1-5 wherein the classifier comprises a molecule 
selected from the group consisting of tubulin, microtubule-associated proteins, actin, 
actin-binding proteins, NF-kB, IkB, and stress-activated kinases. 

20 7. The method of any one of claim 1-6 wherein the cell pathway is selected from 
the group consisting of cell stress pathways, cell metabolic pathways, cell signaling 
pathways, cell growth pathways, and cell division pathways. 

8. The method of claim 1, wherein the second luminescent reporter molecule is an 
25 identifier, and the identifier identifies a toxin or group of toxins selected from the group 

consisting of proteases, ADP-ribosylating toxins, cytotoxic phospholipases, and 
exfoliative toxins. 

9. The method of any one of claim 3-7, wherein the identifier identifies a toxin or 
30 group of toxins selected from the group consisting of proteases, ADP-ribosylating 

toxins, cytotoxic phospholipases, and exfoliative toxins. 
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10. The method of any of claims 1-9 wherein the change in the localization, 
distribution, structure or activity of the first, second, or third luminescent reporter 
molecules is selected from the group consisting of cytoplasm to nucleus translocation, 
nucleus or nucleolus to cytoplasm translocation, receptor internalization, mitochondrial 
5 membrane potential, loss of signal, the spectral response of the reporter molecule, 
phosphorylation, intracellular free ion concentration, cell size, cell shape, cytoskeleton 
organization, metabolic processes, cell motility, cell substrate attachment, cell cycle 
events, and organellar structure and function. 

10 11, The method of any one of claims 1-10, wherein the imaging or scanning 
multiple cells in each of the locations containing multiple cells to obtain luminescent 
signals from the detector is carried out in a high throughput mode, 

12. The method of any one of claims 1-10, wherein the imaging or scanning 
15 multiple cells in each of the locations containing multiple cells to obtain luminescent 

signals firom the detector is carried out in a high content mode. 

13. The method of claim 1-10 wherein the selective imaging or scanning of the 
locations containing cells that were contacted with test sample indicated to haVe a toxin 

20 in it to obtain luminescent signals from the second or third reporter molecule is carried 
out in a high throughput mode. 

14. The method of claim 1-10 wherein the selective imaging or scanning of the 
locations containing cells that were contacted with test sample indicated to have a toxin 

25 in it to obtain luminescent signals fi-om the second or third reporter molecule is carried 
out in a high content mode. 

15. The method of any one of claims 1-14 fiirther comprismg providing a digital 
storage media for data storage and archiving. 

.30 

16. The method of claim 15 further comprising a means for automated control, 
acquisition, processing and display of results. 
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17. A computer readable storage medium comprising a program containing a set of 
instructions for causing a cell screening system to execute the method of any one of 
claims 1-16, wherein the cell screening system comprises an optical system with a stage 
5 adapted for holding a plate containing cells, a means for moving the stage or the optical 
system, a digital camera, a means for directing light emitted from the cells to the digital 
camera, and a computer means for receiving and processing the digital data from the 
digital camera, 

10 18. A kit for cell based toxin detection comprising: 

(a) at least one reporter molecule, wherein the localization, distribution, 
structure, or activity of the reporter molecule is modified when the cell is contacted 
with a toxin; 

(b) instmctions for using the reporter molecule to carry out the method of 
-15 any one of claims 1-16 to detect toxins in a test substance. 

19. The kit of claim 18 fiirther comprising the computer readable storage medium 
of claim 17. 

20 20. An automated method for cell based toxin characterization comprising 

-providing a first array of locations containing cells to be treated with a test 
substance, wherein the cells possess a least a first luminescent reporter moleciale 
comprising a reporter molecule selected from the group consisting of detectors and 
classifiers; 

25 -contacting the cells with the test substance either before or after possession of 

the first luminescent reporter molecule by the cells; wherein the localization, 
distribution, structure, or activity of the first luminescent reporter molecule is modified 
when the cell is contacted with the toxin, 

-imaging or scanning multiple cells in each of the locations containing multiple 
30 cells to obtain luminescent signals from the detector; 

-converting the luminescent signals from the detector into digital data; 
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-Utilizing the digital data from the detector to automatically measure the 
localization, distribution, or activity of the detector on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the detector indicates the 
presence of a toxin in the test substance, 

-providing a second array of locations containing cells to be treated with the test 
substance, wherein the cells possess a least a second luminescent reporter molecule 
comprising a reporter molecule selected from the group consisting of classifiers and 
identifiers, and wherein the second array of locations containing cells can comprise 
either the same or a different cell type as the first array of locations containing cells; 

-contacting the second array of locations containing cells with the test substance 
either before or after possession of the second luminescent reporter molecule by the 
cells; wherein the localization, distribution, stmcture, or activity of the second 
Imninescent reporter molecule is modified when the cell is contacted with the toxin; 

-utilizing the digital data from the second luminescent reporter molecule to 
automatically measure the localization, distribution, or activity of the classifier or 
identifier on or in the cell, wherein a change in the localization, distribution, structure 
or activity of the classifier identifies a cell pathway that is perturbed by the toxin 
present in the test substance, or wherein a change in the localization, distribution, 
structure or activity of the identifier identifies the specific toxin or group of toxins that 
are present in the test substance. 
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I. SIGNAL SEQUENCES 



EPITOPE 


SEQUENCE 


SEQ ID NO: 


REFERENCE 


FLAG epitope 


5 ' GACTACAAAGACGACG 
AA Seq: ACQACAAA 


35 
36 


Kasir, etat.. 1999. J Biol 
Chem. 274:24873-60. 


HA epitope 


5 ' TACCCATACGACGTACCAGACTACGCA 
AA Seq: yPVDVPDYA 


37 
38 


Smith, et aL, 1999. J Bioi 
Chem. 274:19894-900.' 


KT3 epitope 


5 ' CCACCAGAACCAGAAACA 
AA seq: PPEPET 


39 
40 


MacArthur and Walter. 
19B4. J \/)rDl. 52:483-91. 


Myc epitope 


5 ' GCAGAAGAACAAAAATTAATAAGCGAAGA 
AGACTTA 

AA Seq: AEEQKLXSEBDL 


41 
42 


Gosney,etaU 1990. 
Anticancer Res. 10:623-8. 











EYFP: SEQ ro NO: 43 (Nucleic acid); SEQ ID NO:44 (Amino acid) 

MVS K GEEL PTGV VPIL VEL D 
ATGGTGAGCAAG GGCGAGGAQCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTOGAC 

GDVN GH KF SVSG EGEG DATY 
GGCGACGTAAAC GGCCACAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC GATGCCACCTAC 

GKliT IiK PI CTTG KL 'PV PWPT 
GGCAAGCTGACC CTGAAGTTCATC TGCACCACCGGC JVAGCTGCCCGTG . CCCTGGCCCACC 

L V -n . T FGYG .LQCP ARYP DHMK 
CTCGTGACCACC TTCGGCTACGGC CTGCAGTGCTTC GCCCGCTACCCC GACCACATGAAG 

QHDP F KS A MPEG YVQE RTIF 
CAGCACGACTTC TTCAAQTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 

FKDD GNYK T RAE VKFE GDTL 
TTCAAGGACGAC GGCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

VNRI EL KG ID PK E DG N ILGH. 
GTGAACCGCATC GAGCTGAAGGGC ATCGACTTCAAG GAGGACGGCAAC ATCCTGGGQCAC 

KLEY NYN S HNVY I. MAD KQKN 
AAGCTGQAGTAC AACTACAACAGC CACAACGTCTAT ATCATGGCCGAC AAGCAGAAGAAC 

GIKV NFKI RHNI e'dgS V QLA 
GGCATCAAGGTG AACTTCAAGATC CGCCACAACATC GAGGACGGCAGC GTGCAGCTCGCC 

DHYQ QNTP IG D G PVL L PDNH 
GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 
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YLSY QSAL SKDP NEKR DHM V 
TACCTGAGCTAC CAGTCCGCCCTG AGCAAAGACCCC AAGGAGAAGCGC GATCACATGGTC 

LLEF VTAA G ITL GMDE LYK 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



EGFP: SEQIDNO:45 (Nucleic acid); SEQ ID NO:46 (Amino acid) 

MVSK GEEL PTGV VPI L VELD 

atggtgagcaag ggcgaggagctg ttcaccggggtg gtgcccatcctg gtcgagctggac 

GDVN GHK F SVSG EGEG DATY 
GGCGACGTAAAC GGCCACAAGTTC AGCGTGTCCGGC gagggcgagggc gatgccacctac 

gklt lkfi ct tg klpv pwpt 
ggcaagctgacc ctgaagttcatc tgcaccaccggc aagctgcccgtg ccctggcccacc 

liVTT LTY.G VQCF SRYP DHMK 

ctcgtgaccacc ctgacctacggc gtgcagtgcttc agccgctacccc gaccacatgaag 

qhdf fksa mp.eg yvqe rtip 
cagcacgacttc ttcaagtccgcc atgcccgaaggc tacgtccaggag cgcaccatcttc 

fkdd gnyk tra e v.kfe gdt l 
ttcaaggacgac ggcaactacaag acccgcgccgag gtgaagttcgag ggcqacaccctg 

vnri elkg idfk edgn ilgh 
gtgaaccgcatc gagctgaagggc atcgacttcaag gaggacggcaac atcctggggcac 

KLEY NYNS HN.VY I MAD KQKN 
AAGCTGGAGTAC AACTACAACAGC CACAACGTCTAT ATCATGGCCGAC AAGCAGAAGAAC 

GIKV NFK'I RH NI ED GS VQIiA 
GGCATCAAGGTG AACTTCAAGATC CGCCACAACATC GAGGACGGCAGC GTGCAGCTCGCC 

DHYQ QNTP IGDG PVLI, PDNH 
QACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 

YLST QSAL SKDP NEKR DHMV 
TACCTGAGCACC CAGTCCGCCCTG AGCAAAGACCCC AAGGAGAAGCGC GATCACATGGTC 

LLEP VTAA GITIi GMDE LYK 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



EBFP: SEQIDNO:47 (Nucleic acid); SEQ ID lSrO:48 (Amino acid) 

MVSK G.EEIj FTGV VPIL VELD 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 
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GDVN GHKP SVSG EG EG DATY 
GGCGACGTAAAC GGCCACAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC GATGCCACCTAC 

GKLT LKPI CTTG KLP V PWPT 
GGCAAGCTGACC CTGAAGTTCATC TGCACCACCGQC AAGCTGCCCGTG CCCTGGCCCACC 

LVTT LT HG VQCP SRYP D HMK 
CTCGTGACCACC CTGACCCACGGC GTGCAGT3CTTC AGCCGC7ACCCC GACCACATOAAG 

Q H D F F K S A MPEG .Y V Q E R T P 
CAGCACGACTTC TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 

F K DD GNYK T RA E VKFE GDTL 
TTCAAGGACGAC GGCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

VNRI ELKG I D PK E DGN ILGH 
GTGAACCGCATC GAGCTGAAGGGC ATCGACTTCAAG GAGGACGGCAAC ATCCTGGGGCAC 

K LEY .N FNS HN VY IMAD KQ KN 
• AAGCTGGAGTAC AACTTCAACAGC CACAACGTCTAT ATCATGGCCGAC AAGCAGAAGAAC 

GIKV NFK I RHN I EDGS VQLA 
GGCATCAAGGTG AACTTGAAGATC CGCCACAACATC GAGGACGGCAGC GTGCAGCTCGCC 

DHYQ QNTP IGDG PVLL PDNH 
GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 

VLST Q SAL SKDP N EKR DHMV 
TACCTGAGCACC CAGTCCGCCCTG AGCAAAGACCCC AACGAGAAGCGC GATCACATGGTC 

LLEF V TAA GITL GMDE LYK 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



ECPP : SEQ ID NO:49 (Nucleic acid); SEQ ID NO:50 (Amino acid) 

MVSK GEEL FTGV VPIL VE LD 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 

G. DVN GHK F SVSG EGEG DATY 
GGCGACGTAAAC GGCCACAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC GATGCCACCTAC 

GKLT LKF I CT TG KLPV PWPT 
GGCAAGCTGACC CTGAAGTTCATC TGCACCACCGGC AAGCTGCCCGTG CCCTGGCCCACC 

LVTT LTWG V QCF S'R YP DHMK 
CTCGTGACCACC CTGACCTGGGGC GTGCAGTGCTTC AGCCGCTACCCC GACCACATGAAG 

Q HDF FKSA MPEG YVQE RTIF 
CAGCACGACTTC TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 
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PKDD GNYK TRAE VKPE ODTL 
TTCAAGGACGAC GQCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

VNRI ELKG IDFK EDGN ILGH 
GTGAACCGCATC GAGCTGAAGGGC ATCGACTTCAAG GAGQACGGCAAC ATCCTGGGGCAC 

KLEY NYIS HNVY ITAD KQKN 
AAGCTGGAGTAC AACTACATCAGC CACAACGTCTAT ATCACCGCCGAC AAGCAGAAGAAC 
GIKA KTF KI R HK I E DGS VQLA 
GGCATCAAGGCC AACTTGAAGATC CGCCACAACATC GAGGACGGCAGC GTGCAGCTCGCC 

DH YQ QNTP IGDG P VLL PDKTH 
.GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 

YLST QSAL SKDP NEKR D HMV 
TACCTGAGC:7i.CC CAGTCCGCCCTG AGCAAAGACCCC AACGAGAAGCGC GATCACATGGTC 

LLEF VTAA G ITL GMDE LYK 
CTGCTGGAGTTC GTGACCGGCGGC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



Fred25: SEQIDNO:51 (Nucleic acid); SEQ ID NO:52 (Amino acid) 

MASK GEEIi FTGV VPIL V EIiD 
ATGGCTAGCAAA GGAGAAGAACTC TTCACTGGAGTT GTCCCAATTCTT GTTGAATTAGAT 

GDVN GHKP S VSG EGEG DAT Y 
GGTGATGTTAAC GGCCACAAGTTC TCTGTCAGTGGA GAGGGTGAAGGT GATGCAACATAC 

GKLT li KFX CTTG JCLPV PWPT 
GGAAAACTTACC CTGAAGTTCATC TGCACTACTGGC AAACTGCCTGTT .CCATGGCCAACA 

LVTT LCY G VQCF S.RYP DHM K 
CTAGTCACTACT CTGTGCTATGGT GTTCAATGCTTT TCAAGATACCCG GATCATATGAAA 

RHDF FKS A MPEG YV QE RTIP 
CGGCATGACTTT TTCAAGAGTGCC ATGCCCGAAGGT TATGTACAGGAA AGGACCATCTTC 

FKDD GNYK TRAE VKPE GDTL- 
TTCAAAGATGAC GGCAACTACAAG ACACGTGCTGAA GTCAAGTTTGAA GGTGATACCCTT 

VNRI ELKG IDFK EDG N ILGH 
GTTAATAGAATC GAGTTAAAAGGT ATTGACTTCAAG GAAGATGGCAAC ATTCTGGGACAC 

KLEY NYNS HNVY IMAD KQKN 
AAATTGGAATAC AACTATAACTCA CACAATGTATAC ATCATGGCAGAC AAACAAAAGAAT 

GIKV NPKT RHN.I EDGS VQLA 
GGAATCAAAGTG AACTTCAAGACC CGCCACAACATT GAAGATGGAAGC GTTCAACTAGCA 

DHYQ QNT P IGDG PVLL PDNH 



0050872A2J_> 
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GACCATTATCAA CAAAMACTCCA ATTGGCGATGGG CCTGTCCTTTTA CCAGACAACCAT 

YLST QSAL SKDP KEKR DHMV 
TACCTGTCCACA CAATCTGCCCTT TCGAAAGATCCC AACGAAAAGAGA GACCACATGGTC 

LLEP V TA A G I TH G.MDE LYN* 
CTTCTTGAGTTT GTAACAGCTGCT ' GGGATTACACAT GGCATGGATGAA CTGTACAACTAG 
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2. PROTEASE RECOGMTION SITES 



Recognitions 


Source 


Recognition Site 


SbQ ID 
NO 


Reference 


Caspase-1,4^ 


peptide library 


5'(TGG.TTA)GA.ACATaACAA 
acq w , L |Cri u/ 


53 

J** 


Thombcrry ct aL, 1997, J. Biol. 
L^nem. z/z>i fv\jt 


proCaspasb-l 


peptide library 


5'TGGTTTAAAGAC 
AASeq. WFKD/ 


55 
56 


Thombcny el oL, 1997. X Biol. 
Chem. 272:17907 


C&spase-2 


peptide library 


5'GACGAACACGAC 
AA Seq: DEHD/ 


57 
58 


Thombcny ci aU, 1997, J. Biol. 
Chem. 272:17907 


Citspose 3, 7 


PARP 


j*gacgaagttgac 

AA Seq: DEVD/ 


59 
60 


Beneke, et at., 1997. Biochem 
Mol Biol Int. 43:755-61: 
Thomberry et al., 1997, J. Biol. 
Chem. 272:17907 


ProCaspasc 3 


Caspase-3 


5'ATAGAAACAGAC 
AA Seq: lETD/ 


61 
62 


Tewari. M., el at, 1995. . Cell. 
81:801-9. 


ProCaspase-4^ 


peptide library 


5TCGGTAAGAGAC 
AA Seq: WVRD/ 


63 
64 


Thomberry, N.A. etal., 1997; 
J.Biol. Chem. 272, 17907-179! 1 


Cospase 6 


Lamin A, 
peptide library 


5'GTAGAAATAGAC 
AA Seq: VEID/ 
5'GTAGAACACGAC 
AA Seq: VEHD/ 


65 
66 
67 
68 


Nak^jtina and Sado. 1993. 
Biochi m Biophys Acta. 1 1 7 1 :3 1 1 - 
4: Thomberry et al., 1997, J. Biol. 
Chem. 272:17907 


proCaspasc 6 


Caspase-6 


5*ACAGAAGTA0AC 
AA Seq: TEVD/ 


69 
70 


Femandes-Atncmrii et al., 1994. J 
Biol Chem. 269:30761-4. 


proCa5pa5c-7 


peptide library 


5'ATACAAGC^aAC 
AA Seq: IQAD/ 


71 
72 


Thombcny. N.A. el aU. 1997> 
J.Biol. Chem. 272, 17907-17911 


Caspase 8 


. peptide library 


5'GTAGAAACAGAC 
AA Seq: VETO/, 


73 
74 


Muzio* M., etat., 1996. Cell. 
85:817-27; Femandes-Alnemn\ et 
ah. 1996. Proc Natl Acad Set U S 
A. 93:7464-9;Thomberry ct al., 
1 997, J. BioK Chem. 272: 1 7907 


proC8spase-8 


Caspase-S 


5'TTAGAAACAOAC 
AA Seq: LETD/ 


75 
76 


Muzio, M., et al., 1996. Cell. 
85:817-27; Femandes-Alnemri, ct 
al.. 1996. Proc Natl Acad Sci U S 
A. 93:7464-9;ThombcrTy ct al., 
1997, J. Biol. Chem. 272:17907 


. Cospase 9 


peptide library 


5*TTAOAACACGAC 
AA Seq: LEHD/ 


77 
78 


Thombcny, N.A. et al., 1997, 
J.Biol. Chem. 272. 17907-1791 1 


proCaspasc 9* 


Caspa5e-9 


CCCGAACCCGAC 
PEPD 


79 
80 


Thombcny, N.A. el al., 1997, 
J.Biol. Chem. 272, 17907-1791 1 


HIV protease 




5*AGCCAAAATTAC 
AAScq:SQNY/ 

5'CCAATAGTACAA 
AA Sep: PIVQ/ 


81 
82 

83 
84 


Matayoshi. ct al, 1990. Science. 
247:954-8. 


Adenovirus 
endopeptidase 




5'AUGTTrGGAGGA 
AA Seq: MFGG/ 

5'GC AAAAAAAAGA 
AA Seq: AKKR/ 


85 
86 

87 

88 , 


Weber and Tihanyi. 1994. 
Methods Enzymol. 244:595-604. 


b-Secretase 


Amyloid 
precursor 
protein 


5'GTAAAAAUQ 
AA Seq. VKM/ 

S'OACGCAGAATTC 
DAEF/ 


89 
90 

91 

92 


Hardy et al., 1994. in Amyloid 
Protein Precursor in 
Development. Aging, and 
Alzheimer's Disease, cd. C.L. . 
Masters et al., pp. 1 90-t 98. 


Caihcpsin D 




5*AAACCA0CA1TATTC 
AA Seq: KPALF 

5'TTCAGATTA 
AA Seq: FRL/ 


93 
94 

95 
96 


Dunn, et al.. 1998. Adv Exp Med 
Biol. 436:133-8. 


Mnlrix 

Mctallo'protcascs 




5'GGACCATTAGGACCA 
AA ScqrGPLGP 


97 
98 


Bouvier et al., 1993; Gorbett el 
al.. 1999; Hill and Sakanari, 1997; 
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Kojima et 1998; Tyagi ei al., 
1995; Wilhelmetal., 1993; 
Williams and Auld, 1986; 
Hauglandt Handbook of 
fluorescent probes and research 
Chemicals 7th ed. 


Granzymc B 


peptide library 


5 ' ATAG A ACCAG AC 
AAScq:IEPD/ 


99 
100 


Thomberry et al., 1997, J. Biol. 
Chcm. 272:17907 


Anthrux protease 


MEICl 


5 • ATGCCC AAGAAOA AGCCG AC 
GCCCATCCAGCTGAACCC 

AA Seq: MPKKKPTPIOLN 


101 
102 


Vttale et al., (t 998) Biochem 
Biophys Res Commun 248 (3), 
706-71 1 


Anthrax proleasc 


MEO 


5'ATGCTGCCCCGGAGGAAGCCG 

GTGCTCCCGGCCCTCACCATCA 

ACCC 

A A Seq: MLARRKPVLPALTJN 


103 
104 


Vitale et al., (1 998) Biochem 
Biophys Res Commun 248 (3), 
706-71 1 


letanusAsotulinum 


cellubrevin 


5'GCCTCGCAGTTTGAAACA 
AA Seq: ASOFET 


105 
106 


McMahon et al., Nature 364 J46- 
349; Martin et at., J. Cell Biol. In 
press 


tetanus/botulinum 


synaptobrevin/ 
VAMPJ 


5 •OCTTCTCAATTTGAAACG 
AA Sea: ASOFET 


107 
108 


Schiavo et at., (1992) Nature 
359. 832-5 


Botulinum 
neurotoxin A 


SNAP-25 


5'GCCAACCAACGTGCAACA 
AA Seq: ANO/RAT 


109 
110 


Zhao, et al. Gene 145 (2), 3I3- 
314^994) 


Botulinum 
neurotoxin B 


VAMP 


5'GCTTCTCAATTTGAAAGG 
AA Seq: ASQ/FET 


ill 
112 




Botulinum 
neurotoxin C 


Syntaxth 


5'ACGAAAAAAGCTGTGAAA 
AA Seq: TKK/AVK 


113 
lU 


Martin et al., J. Leukoc. Biol. 65 
(3). 397^06 (1999) 


Botulinum 
neurotoxin D 


VAMP 


5*CACCAGAAGCTCTCTGAG 
AA Seq: DOK/LSE 


115 
116 




Botulinurn 
neurotoxin E 


SNAP-25 


5*ATCGACAGGATCATGGAG 
AA Seq: IDR/IME 


117 
118 




Botulinum 
neurotoxin F 


VAMP 


S^AGAOACCAGAAOCTCTCT 
AA Seq: RDO/KLS 


119 
120 




Botulinum 
neurotoxin G 


VAMP 


5 ' ACG AGCGCAGCCA AGTTG 
AA Seq: TSA/AKL 


121 
122 
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3. PRODUCT/REACTANT TARGET SEQUENCES 



Target 


Target Source 


Target domain (Product or Reactant) 


SEQ ID 
NO 


Reference 


Cytoptasm/cyios 
kcleton 


Annexin tl 


5'ATGTCTACTGTCCACGAAATCCTGTGCAAG 

CTCAGCrrGGAGCGTCTTCATTtTACACCCXr 

AAaTGCC3* 

(Amino acid seq: M S T V H E 1 L C K L S L 
EOVHSTPPSA) 


123 

124 ' 


Eberhaid, et al., 
1997. Mol. Biol. 
Cell 8:293a. 


inner surface of 

plasma 

membrane 


famesylation 


5'AUGGGATCTACATTAAGCGCAGAAGACAA. 
AGCAGCAGTAGAAAGAAOCAAAAUGATAGA 
CAOAAACTTATTAAGAGAAGACGGAaAAAA 
AGCTGCTAGA3' 

(AAseq: M 0 C T L S A E D K A A V E R 
SKMIDRNL'REDGEKAAR 


125 
126 


Femiccio G. et aL, 
J. BioK Chem.274, 
5843-5850. 1999 


Nucleus 


NFkB p50 


5'AGAAGGAAACGACAAAAG 
(AA scq: R R K R Q K) 


127 


Henkel.Tetal., 
Cell 68,1121- 
1133.1992 


Nucleolus 


NOLP 


5'AOAAAACGTATACGTACTTACCTCAACTCC 
TGCAGGCGQATOAAAAGAAGTGGTTTTGAOA 
TGTCTCGACCTATTCCTTCCCACCTrACT 

(AAseq: RKRIRTYLKSCRRMK 
RSGFEMSRPIPSHLT) 


129 
130 


Uelct, etal^ 1998. 
Biochem Biophys 
Res Commun. 
252:97-102. 


Mitochondria 


cytochrome c 
oxidase 


5'ATGTCCCTCCTGACGCCGCTGCTCKrrGCGG 
GGCTTGACAGGCTCGOCXrCGGCGGCTCCCAG 
TGCCXjCGCtjCCAAGATCrATrCGTTO 

(AASeq:MSVLTPLLLRGLTGS 
ARRLPVPRALIHSU 


131 

132 


Rizzuto, cl al., 
1989. J BiotChem. 
264:10595-600. 


Nuclear Envelope 


ODV.E66 & 
ODV-E25 


5'AUOAGCATTGTTTTAATAATTGTTATrTGGA 
riTl 1 J l AATATGTTTTTTATATTTAAGCAACA 

gcaaagatcccagagtacc:agttgaattaau 

G 

(AAScq:MSIVLMVIVVIFLICF 
LYLSNSKDPRVPVELM) 


133 
134 


Hong, T, ct al. 
PNAS. 94» 4050- 
4055, 1997 


Golgi 


Calreticulin 


5*ATGAGGCnTCGGGAGCCOCTCCTGAGCGGC 
AGCCCXGCOATGCCAGGCGCGTCCCTACAGC 
GGGC(rrGCCCCCTGCTCGTGGCCGT(rrGCGCT 
CTGCACCnTGGCGTCACCCTCGTrTACTACCT 

(JUL. i UUV^ifU^UA^^ I U/\UL«^ULfLi> * \J\^\,A^\^nA ■ 

CTG(TrCGQAGTCTCCACACC(3CrrGCAGGGCG 
GCTCGAACAGTGCCGCCOCCATCGGGCAQTC 
CTCCGOGGAGCTCCGGACCGGAGGOGCC 

(AA Seq: MRLREPLLSCSAAMP 
GASLQRACRLLVAVCALHLGVTL 
VYYLAGRDLSR.LPQLVGVSTPLQG 
GSNSAAAICOSSGELRTGGA) 


135 
136 


Fiiegel, L*, et al.,J. 
Biol. Chem. 264, 
21522 21528. 
1989. 


Endoplasmic 
reticulum 


D-AICAPI 


5'GAAACAATAAGACCrrATAAGAAGATGTAGT 

ACATTTACATCTACAGACAGCAAAAUGGCAA 

TTCAAITAAGATCTCCCTTTCCATTAGCATTA 

CCAGOAAUGTTACCTTTATTAGGATGGTGOt 

GGTTTTTCACTAGAAAAAAA 

(AA Scq:ETIRPI RI RRC6 YFTSTDS KM 

AIQLRSPFPLALPGMLALLGWWW 

FFSRKK 


137 
138 


Huang. U. Et al.. 
J. Cell. Biol. 145, 
951-959, 1999 


Nuclear Export 


MEK1 


5 ' GCCTTGCAGAAGAAGCTGGAGGAGCT . 
AGAGCTTGATGAG 


139 


Fukuda. (1997) 
J. Biol. Chem 
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(AA SEQ:A LQKKLEELE 
L D E 


140 


272. 51,32642- 
32648 


Size exclusion 


PROJ domain of 
MAP4 


S'GCCOACCTCAGTCTTGTGGATGCGTFGACA 
OAACCACCTCCAOAAATTGACGGAGAAATAA 
AGCGAGACTTCATGGCTGCCCTGGAGGCAQA 
GCCCTATGATGACATCGTGOGAGAAACTOTG 

TOATOAOAAAACCGGQAACTCAOAGTCCAAA 

AAG AAACCCTGCTTAGACACrAGCCAGGTTQ 

AAGGTATCCCATCTTCTAAACCAACACTCCTA 

GCCAATGGTGATCATGGAATGGAGGGGAATA 

ACACTCCAGGGTCTCCAACTOACTTCCTTGAA 

QAGAGAGTGGACTATCCGGATTATCAGAGCA 

GCCAGAACTCGCCAGAAGATGCAAGCTnTG 

TTTCCACrCTCAGCAAGTGTTAGATACTGACC 

AGGCTOAGCCCTTTAACOAGCACCGTGATGA 

TGCTTTGGCAOATCTGCTCTTTGTrrCCAGTG 

GACCCACGAACGCTTCTGCATTTACAGAGCO 

AGACAATCCTTCAGAAGACAGTrACGOTATO 

CTTCCCrCTGACTCATTTGCTrCCACCGCTGT 

TGTATCTCAGGAGTGGTCTGTGGGAGCCCCA 

AGAGGTTACTATAGAAACCCTACAGCCAGCA 

ACAGAGCTCTCCAAGGCAGCAGAAGTGGAAT 

CAGTGAAAGAGCAGCTGCCAGCTAAAOCATT 

GGAAACOATOOCAGAGCAaACCACTGATOTG 

GTGCACTCTCCATCCACAQACACAACACCAO 

GCCCAGACACAGAGGCAGCACTGGCTAAAGA 

CATAOAAGAQATCACCAAaCCAGATGTGATA 

TTGGCAAATGTCACGCAGCCATCTACTGAAT 

CGGATATGTTCCTCGCCCAGGACATCGAACT 

ACTCACAGGAACAGAGGCAGCCCACGCTAAC 

AATATCATATTGCCTACAGAACCAGAeGAAT 

CTTCAACCAAGGATGTAGCACCACCTATCGA 

AGAAGAAATTGTCCCAGGCAATGATA 

(AASEQ: ADLSLVDALTEPPPEtEQEI 
KRDFMAALEAEPYDDJ VGETVEICT 
EFIPLLDGDEKTGNSESKKKPCLD 
TSQVEGIPSSKPTLLANGDHGMEG 
NNTAOSPTDFLEERVDYPDYQSS 
QNWPEDASFCFQPQQ VLDTDQAB 
PFNEHRDDQL ADLLFVSSGPTNAS 
AFTERDNPSEDSYGMUPCDSFAST 
A VVSQEWS VG APNSPCSESC VSP 

LPAKALETM AEQTTDVVHSPSTDT 
TPGPDTEAALAKDIEEITKPDVILA 
. NVTQPSTESDMFLAQDMELLTGTE 
AAHANNIILPTEPDESSTKDVAPPM 
EEEIVPGNDTTSPICETETTLPIICMD 
LAPPEDVLLTKETELA P AKGM VSL 
SEIEEALA ICN.D VRSAEIPVAOETV 
VSETEVVLATE VVLPSDPITTLTK. 
DVTLPtEAERPLVTDMTPSLETEM 
TLGKETAPPTETNLGMAKDMSPLP 
ESEVTLGKDVVILPETKVAEFNNV 
TPLSEEEVTSVK.DMSPSAETEAPL 
AKNADLHSGTELIVDNSMAPASDL 
ALPLETKVATVPIKDKG 


141 
142 


West. (1991). J 
Bloi Chem 
266(32): 218B6- 
96: Olson, K. R. 
(1995). J Cell 
Biol 130(3): 639- 
50. 


Vesicle 
membrane 


Synaptobrevin 


5 ' ATGTGGGCAA.TCGGGATTACTGTTCT 
GGTTATCTTCATCATCATCATCATCGTG 
TGQGTTGTC 

(AA SEO: MWAIGITVLV 
IFIIIIIVWVV) 


143 
144 


Schtavo et al., 
(1992) Nature 
359. 632-5 
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Vesicle 
(TismDrane 


Celtubrevin 


5 ' ATGTGGGCGATAGGGATCAGTGTCCT 
GGTGATCATTGTCATCATCATCATCGTG 
TGGTGTG 

(AA SEQ: MWAIGISVLV 
IIVIIIIVWC) 


145 
146 


McMahon et al.. 
' Nature od4 1*540-' 
349: Martir) et al., 
J. Celt Biol. In 
press 


Nuclear Export 


MEK2 


5 ' GACCTGCAGAAGAAGCTGGAGGAGCT 
QGAACTTGACGAG 

AA SEQ: D L Q K K L E E L E L D E 


147 
148 


Zheng and Guan, 
J. Biol. Chcm. 
268:11435-11439, 
1993 


Peroxisome 


PX 


5 ' TCTAAACTG 
AA SEQ: S K L 


149 
150 


Amery et al., 
Btochem. J. 
336:367-371 
0998) 













Microtubules (MAP4) SEQ ID NO:151 (Nucleic acid); SEQ ID NO:152 (amino acid) 



MAP4 : 

MADL SLVD AL T.E PPPE lEGE 
ATGGCCGACCTC AGTCTTGTGGAT QCGTTGACAGAA CCACCTCCAQAA ATTGAGGGAGAA 
TACCGGCTGGAG TCAGAACACCTA CGCAACTGTCTT -GGTGGAGGTCTT TAACTCCCTCTT 

IK.RD FMAA LEAE P Y DD IVGE 
ATAAAGCGAGAC TTCATGGCTGCG CTGGAGGCAGAG CCCTATGATGAC ATCGTGGGAGAA 
TATTTCGCTCTG AAGTACCGACGC GACCTCCGTCTC GGGATACTACTG TAGCACCCTCTT 

TVEK TEFI PLLD GDEK TGNS 
ACTGTGGAGAAA ACTGAGTTTATT CCTCTCCTGGAT GGTGATGAGAAA ACCGGGAACTCA 
TGACACCTCTTT TGACTCAAATAA GGAGAGGACCTA CCACTACTCTTT TGGCCCTTGAGT 

ESK K KPCL DTSQ VEGI PSSK 
GAGTCCAAAAAG AAACCCTGCTTA GACACTAGCCAG GTTGAAGGTATC CCATCTTCTAAA 
CTCAGGTTTTTC TTTGGGACGAAT CTGTGATCGGTC CAACTTCCATAG GGTAGAAGATTT 

P T £ L . A. N G D H G M E G N N T . A G S P 
CCAACACTCCTA GCCAATOGTGAT CATGGAATGGAG GGGAATAACACT GCAGGGTCTCCA 
GGTTGTGAGGAT CGGTTACCACTA GTACCTTACCTC CCCTTATTGTGA CGTCCCAGAGGT 

TDFL E ERV DYPD YQSS QNWP 
ACTGACTTCCTT GAAGAGAGAGTG GACTATCCGGAT TATCAGAGCAGC CAGAACTGGCCA 
TGACTGAAGGAA CTTCTCTCTCAC CTGATAGGCCTA ATAGTCTCGTCG GTCTTGACCGGT 

EDAS PCFQ PQQV L DTD QABP 
GAAGATGCAAGC TTTTGTTTCCAQ CCTCAGCAAGTG TTAGATACTGAC CAGGCTGAGCCC 
CTTCTACGTTCG AAAACAAAGGTC GGAGTCGTTCAC AATCTATGACTG GTCCGACTCGGQ 

FNEH RDDG IiADL LFVS SG.PT 
TTTAACGAGCAC CGTGATGATGGT TTGGCAGATCTG CTCTTTGTCTCC AGTGGACCCACG 
AAATTGCTCGTG GCACTACTACCA AACCGTCTAGAC GAGAAACAGAGG TCACCTGGGTGC 

NASA FTE R DNPS EDSY GM LP 
AACGCTTCTGCA TTTACAGAGCGA ' GACAATCCTTCA GAAGACAGTTAC GGTATGCTTCCC 
TTGCGAAGACGT AAATGTCTCGCT CTGTTAGGAAGT CTTCTGTCAATG CCATACGAAGGG 
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CDS P ASTA VVS Q EWSV GAPN 
TGTGACTCATTT GCTTCCACGGCT GTTGTATCTCAG GAGTGGTCTGTG GGAGCCCCAAAC 
ACACTGAGTAAA CGAAGGTGCCGA CAACATAGAGTC CTCACCAGACAC CCTCGGGGTTTQ 

SPCS ESCV SPE V TIE T LQPA 
TCTCCATGTTCA GAGTCCTGTGTC TCCCCAGAGGTT ACTATAGAAACC CTACAGCCAGCA 
AGAGGTACAAGT CTCAGGACACAG AGGGGTCTCCAA TGATATCTTTGG GATGTCGGTCGT 

TELS KAA B VE SV KEQL PAKA 
ACAGAGCTCTCC AAGGCAGCAGAA GTGGAATCAGTG AAAGAGCAGCTG CCAGCTAAAGCA 
TGTCTCGAGAGG TTCCGTCGTCTT CACCTTAGTCAC TTTCTCGTCGAC GGTCGATTTCGT 

LETM AEQT TDVV HSPS T DTT 
TTGGAAACGATG GCAGAGCAGACC ACTGATGTGGTG CACTCTCCATCC ACAGACACAACA 
AACCTTTGCTAC CGTCTCGTCTGG TGACTACACCAC GTGAGAGGTAGG TGTCTGTGTTGT 

PGP D TEAA LAKD lEEI TKPD 
CCAGGCCCAGAC ACAGAGGCAGCA CTGGCTAAAGAC ATAGAAGAGATC ACCAAGCCAGAT 
GGTCCGGGTCTG TGTCTCCGTCGT GACCGATTTCTG TATCTTCTCTAG TGGTTCGGTCTA 

V ILA NVTQ PSTE SDM P LAQD 
GTGATATTGGCA AATGTCACQCAG CCATCTACTGTIA TCGGATATGTTC CTGGCCCAGGAC 
CACTATAACCGT TTACAGTGCGTC GGTAGATGACTT AGCCTATACAAG GACCGGGTCCTG 

MEIili TGTE AAHA NN I I L P'tE 
ATGGAACTACTC ACAGGAACAGAG GCAGCCCACGCT AACAATATCATA TTGCCTACAGAA 
'I'ACCTTGATGAG* TGTCCTTGTCTC CGTCGGGTGCGA TTGTTATAGTAT AACGGATGTCTT 

PDES STKD VAPP'MEEE IVPG 
CCAGACGAATCT TCAACCAAGGAT GTAGCACCACCT ATGGAAGAAGAA ATTGTCCCAGGC 
GGTCTGCTTAGA AGTTGGTTCCTA CATCGTGGTGGA .TACCTTCTTCTT TAACAGGGTCCG 

NDT T SPKE TETT LPIK MDIiA 
AATGATACGACA TCCCCCAAAGAA ACAGAGACAACA CTTCCAATAAAA ATGGACTTGGCA 
TTACTATGCTGT AGGGGGTTTCTT TGTCTCTGTTGT GAAGGTTATTTT TACCTGAACCGT 

PPED VLLT KETE L A PA KGMV 
CCACCTGAGGAT GTGTTACTTACC AAAGAAACAGAA CTAGCCCCAGCC AAGGGCATGGTT 
GGTGGACTCCTA CACAATGAATGG TTTCTTTGTCTT GATCGGGGTCGG TTCCCGTACCAA 

SLSE l EEA LAKN DVRS A EIP 
TCACTCTCAGAA ATAGAAGAGGCT CTGGCAAAGAAT GATGTTCGCTCT GCAGAAATACCT 
AGTGAGAGTCTT TATCTTCTCCGA GACCGTTTCTTA CTACAAGCGAGA CGTCTTTATGGA 

VAQE TVVS ETE V VLAT E VVL 
GTGGCTCAGGAG ACAGTGGTCTCA GAAACAGAGGTG GTCCTGGCAACA GAAGTGGTACTG 
CACCGAGTCCTC TGTCACCAGAGT CTTTGTCTCCAC CAGGACCGTTGT CTTCACCATGAC 

PSDP ITTL TKDV TLPL E 'AER 
CCCTCAGATCCC ATAACAACATTG ACAAAGGATGTG ACACTCCGCTTA GAAGCAGAGAGA 
GGGAGTCTAGGG TATTGTTGTAAC TGTTTCCTACAC TGTGAGGGGAAT CTTCGTCTCTCT 
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PLVT DMTP SliET EMTL GKET 
CCGTTGGTGACG • GACATGACTCCA TCTCTGGAAACA GAAATGACCCTA GGCAAAGAGACA 
GGCAACCACTGC CTGTACTGAGGT AGAGACCTTTGT CTTTACTGGGAT CCGTTTCTCTGT 

APPT ETNL GMAK D MSP liPES 
GCTCCACCCACA GAAACAAATTTG GGCATGGCCAAA GACATGTCTCCA CTCCCAGAATCA 
CGAGGTGGGTGT CTTTGTTTAAAC CCGTACCGGTTT CTGTACAGAGGT GAGGGTCTTAGT 

EVT L GKDV VI LP ETKV AEFN 
GAAGTGACTCTG GGCAAGGACGTG GTTATACTTCCA GAAACAAAGGTG GCTGAGTTTAAC 
CTTCACTGAGAC CCGTTCCTGCAC CAATATGAAGGT CTTTGTTTCCAC CGACTCAAATTG 

NVTP LS'EB E VT S VKDM SPSA 
AATGTGACTCCA CTTTCAGAAGAA GAGGTAACCTCA GTCAAGGACATG TCTCCGTCTGCA 
TTACACTGAGGT GAAAGTCTTCTT CTCCATTGGAGT CAGTTCCTGTAC AGAGGCAGACGT 

ETEA PLAK NADL H SGT ELIV 
GAAACAGAGGCT CCCCTGGCTAAG AATGCTGATCTG jCACTCAGGAACA GAGCTGATTGTG 
CTTTGTCTCCGA GGGGACCGATTC TTACGACTAGAC GTGAGTCCTTGT CTCGACTAACAC 

DNSM APAS DLAL PLET .KVAT 
GACAACAGCATG GCTCCAGCCTCC GATCTTGCACTG CCCTTGGAAACA AAAGTAGCAACA 
CTGTTGTCGTAC CGAGGTCGGAGG CTAGAACGTGAC GGGAACCTTTGT TTTCATCGTTGT 

vpik'dkg't vq te e.kpr edsq 

GTTCqAATTAAA GACAAAGGAACT GTACAGACTGAA GAAAAACCACGT GAAGACTCCCAG 
CAAGGTTAATTT CTGTTTCCTTGA CATGTCTGACTT CTTTTTGGTGCA CTTCTGAGGQTC 

LASM QHKG.QSTV PPCT ASPE 
TTAGCATCTATG CAGCACAAGGGA CAGTCAACAGTA CCTCCTTGCACG GCTTCACCAGAA 
AATCGTAGATAC GTCGTGTTCCCT GTCAGTTGTCAT 'gGAGGAACGTGC CGAAGTGGTCTT 

PVK A AEQM STLP IDAP S PLE 
CCAGTCAAAGCT GCAGAACAAATG TCTACCTTACCA ATAGATGCACCT TCTCCATTAGAG 
GGTCAGTTTCGA CGTCTTGTTTAC AGATGGAATGGT TATCTACGTGGA AGAGGTAATCTC 

NLEQ KETP GSQP SE PC SGVS 
AACTTAGAGCAG AAGGAAACGCCT GGCAGCCAGCCT TCTGAGCCTTGC TCAGGAGTATCC . 
TTGAATCTCGTC TTCCTTTGCGGA CCGTCGGTCGGA AGACTCGGAACG AGTCCTCATAGG 

RQEE AK AA VG-V T GNDI TTPP 
CGGCAAGAAGAA GCAAAGGCTGCT GTAGGTGTGACT GGAAATGACATC ACTACCCCGCCA 
GCCGTTCTTCTT CGTTTCCGACGA CATCCACACTGA CCTTTACTGTAG TGATGGGGCGGT 

NKEP PPSP EKK. A KPLA TTQP 
AACAAGGAGCCA CCACCAAGCCCA GAAAAGAAAGCA AAGCCTTTGGCC ACCACTCAACCT 
TTGTTCCTCGGT GGTGGTTCGGGT CTTTTCTTTCGT TTCGGAAACCGG TGGTGAGTTGGA 

A K T S TSKA KTQP' TSL.P K Q PA 
GCAAAGACTTCA ACATCGAAAGCC AAAACACAGCCC ACTTCTCTCCCT AAGCAACCAGCT 
CGTTTCTGAAGT TGTAGCTTTCGG TTTTGTGTCGGG TGAAGAGAGGGA TTCGTTGGTCGA 
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PTTS GGLW KKPM SLAS GSV P 
CCCACCACCTCT GGTGGGTTGAAT AAAAAACCCATG AGCCTCGCCTCA GGCTCAGTGCCA 
GGGTGGTGGAGA CCACCCAACTTA TTTTTTGGGTAC TCGGAGCGGAGT CCGAGTCACGGT 

AAPH KRP A AATA TA RP STIi P 
GCTGCCCCACAC AAACGCCCTGCT GCTGCCACTGCT ACTGCCAGGCCT TCCACCCTACCT 
CGACGGGGTGTG TTTGCGGGACGA CGACGGTGACGA TGACGGTCCGGA AGGTGGGATGGA 

ARD V KPKP I tEA KVAE K.RTS 
GCCAGAGACGTG AAGCCAAAGCCA ATTACAGAAGCT AAGGTTGCCGAA AAGCGGACCTCT 
CGGTCTCTGCAC TTCGGTTTCGGT TAATGTCTTCGA TTCCAACGGCTT TTCGCCTGGAGA 

PSKP SSA P AliK P GPK T TPTV 
CCATCCAAGCCT TCATCTGCCCCA GCCCTCAAACCT GGACCTAAAACC ACCCCAACCGTT 
GGTAGGTTCGGA AGTAGACGGGGT CGGGAGTTTGGA CCTGGATTTTGG TGGGGTTGGCAA 

SKA T SP 'ST LVST GPSS RSPA 
TCAAAAGCCACA TCTCCCTCAACT CTTGTTTCCACT GGACCAAGTAGT AGAAGTCCAGCT 
AGTTTTCGGTGT AGAGGGAGTTGA GAACAAAGGTGA CCTGGTTCATCA TCTTCAGGTCGA 

TTLP KRPT S I KT EGK P ADVK 
ACAACTCTGCCT AAGAGGCCAACC AGCATCAAGACT GAGGGGAAACCT GCTGATGTCAAA 
TGTTGAGACGGA TTCTCCGGTTGG TCGTAGTTCTGA CTCCCCTTTGGA CGACTACAGTTT 

RMTA KSA S AD LS RSKT TSAS 
AGGATGACTGCT AAGTCTGCCTCA GCTGACTTGAGT CGCTGAAAGACC ACCfCTGCCAGT 
TCCTACTGACGA TTCAGACGGAGT CGACTGAACTCA GCGAGTTTCTGG TGGAGACGGTCA 

SVKR NTTP TGAA PPAG MTS T 
TCTGTGAAGAGA AACACCACTCCC ACTGGGGCAGCA CCCCCAGCAGGG ATGACTTCCACT 
AGACACTTCTCT TTGTGGTGAGGG TGACCCCGTCGT GGGGGTCGTCCC ' TACTGAAGGTGA 

RVKP M SAP SRSS .GALS VDKK 
CGAGTCAAGCCC ATGTCTGCACCT AGCCGCTCTTCT GGGGCTCTTTCT GTGGACTIAGT^G 
GCTCAGTTCGGG TACAGACGTGGA TCGGCGAGAAGA CCCCGAGAAAGA CACCTGTTCTTC 

PTST KPSS SA PR VSRIj ATTV 
CCCACTTCCACT AAGCCTAGCTCC TCTGCTCCCAGG GTGAGCCGCCTG GCCACAACTGTT 
GGGTGAAGGTGA TTCGGATCGAGG AGACGAGGGTCC CACTCGGCGGAC CQGTGTTGACAA 

SAPD LKSV RSK V GSTE NIKH 

tctgcccctgac ctgaagagtgtt cgctccaaggtc ggctctacagaa aacatc/^aacac 
agacggggactg gacttctcacaa GCGAGGTTCCAG ccgagatgtctt ttgtagtttgtg 

QPGG GRAK VEKK TEAA TTA G 
CAGCCTGGAGGA GGCCGGGCCAAA GTAGAGAAAAAA ACAGAGGCAGCT ACCACAGCTGGG 
GTCGGACCTCCT CCGGCCCGGTTT CATCTCTTTTTT TGTCTCCGTCGA TGGTGTCGACCC 

KPEP N AVT KAAG S.IAS AQ KP 
AAGCCTGAACCT AATGCAGTCACT AAAGCAGCCGGC TCCATTGCGAGT QCACAGAAACCG 
TTCGGACTTGGA TTACGTCAGTGA TTTCGTCGGCCG AGGTAACGCTCA CGTGTCTTTGGC 

PAGK V'.QIV S KKV S'YSH I QS K 
CCTGCTGGGAAA GTCCAGATAGTA TCCAAAAAAGTG AGCraCAGTCAT ATTCAATCCAAG 
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GGACGACCCTTT CAGGTCTATCAT AGGTTTTTTCAC TCGATGTCAGTA TAAGTTAGGTTC 

CVSK DNIK HVPG CGNV QIQ N 
TGTGTTTCCAAG GACAATATTAAG CATGTCCCTGGA TGTGGCAATGTT CAGATTCAGAAC 
ACACAAAGGTTC CTGTTATAATTC GTACAGGGACCT ACACCGTTACAA GTCTAAGTCTTG 

KK VD ISKV SSKC GSKA NI KH 
AAGAAAGTGGAC ATATCCAAGGTC TCCTCCAAGTGT GGGTCCAAAGCT AATATCAAGCAC 
TTCTTTCACCTG TATAGGTTCCAG AGGAGGTTCACA CCCAGGTTTCGA TTATAGTTCGTG 

KPG G GDV K lES Q KLNF KEKA 
AAGCCTGGTGGA GGAGA^GTCAAG ATTGAAAGTCAG AAGTTGAACTTC AAGGAGAAGGCC 
TTCGGACCACCT CCTCTACAGTTC TAACTTTCAGTC TTCAACTTGAAG TTCCTCTTCCGG 

QAKV GSLD NVGH FPAG GAVK 
CAAGCCAAAGTG GGATCCCTTGAT AACGTTGGCCAC TTTCCTGCAGGA GGTGCCGTGAAG 
GTTCGGTTTCAC CCTAGGGAACTA TTGCAACCGGTG - AAAGGACGTCCT CCACGGGACTTC 

TEGG GS EA LPCP GPPA 6EEP 
ACTGAGGGCGGT GGCAGTGAGGCC CTTCCGTGTCCA GGCCCCCCCGCT GGGGAG6AGCCA 
TGACTCCCGCCA CCGTCACTCCGG GAAGGCACAGGT CCGGGGGGGCGA CCCCTCCTCGGT 

VI P E A AP D RGAP TSAS GLSQ" 
GTCATCCCTGAG GCTGCGCCTGAC CGTGGCGCCCCT ACTTCAGCCAGT GGCCTCAGTGGC 
CAGTAGGGACTC CGACGCGGACTG GCACCGCGGGGA TGAAGTCGGTCA CCGGAGTCACCG 

HTTIi SGGG DQ R E p'qT Ij D^SQI 
CACACCACCCTG TCAGGGGGTGGT GACCAAAGGGAG CCCCAGACCTTG GACAGCCAGATC' 
GTGTGGTGGGAC AGTCCCCCACCA CTGGTTTCCCTC GGGGTCTGGAAC CTGTCGGTCTAG 

Q E T S I * 
CAGGAGACAAGC ATCTAA 
GTCCTCTGTTCG TAGATT 
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Fig* 50.^ General design b^bib 
azid product containing separate targeting and signal 
sequences . Bottom: Specific example of tbis Approach— 
Caspase 3 biosensor withjreactant targeted to cytoskeleton 
and product targeted to nucleus. . 
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* * * 

NFKB pp38 

Fig, 36 Dual-labeling assay in tH>o cell types with 3 drugs and 3 drug combinations. 
Treatments marked with an asterisk are different from controls at a 99% confidence 
level (p< 0.01). 
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SEQUENCE LISTING 

<110> Giuliano, Kenneth A. 
Kapur , Ravi 

<120> A System for Cell Based Screening 

<130> 97-022-L 

<14 0> To Be Assigned 
<141> Filed Herewith 

<160> 180 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 1770 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (882) 

<:220> 

<223> Description of Artificial Secjuence : 
GFP-DEVD-Annexin II construct 

<400> a 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Vkl Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 . 40 45 

tgc acc ace ggc aag ctg ccc gtg cec tgg ece acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac eac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 2 88 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gae ggc aac tac aag ace cgc gee gag 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 . 
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ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 4 32 
lie Asp Phe Lys Glu Asp Qly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age eac aac gtc tat ate atg gcc gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 ISO 155 ISO 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 52 8 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 57 6 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg. ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 . 240 

gga etc aga tct ggc gcc ggc get gga gcc gga get ggc gcc gga gee 768 
Gly Leu Arg Ser Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala 
245 250 255 

gac gag gtg gac ggc gcc ggc gcc gat gaa gta gat ggc gcc atg tct 816 
Asp Glu Val Asp Gly Ala Gly Ala Asp Glu Val Asp Gly Ala Met Ser 
260 265 270 

act gtc cac gaa ate ctg tgc aag etc age ttg gag ggt gat cat tct 864 
Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Asp His Ser 
275 280 285 

aca ccc cca agt gcc tat tgaatggtga gcaagggcga ggagctgtte 912 
Thr Pro Pro Ser Ala Tyr 
290 

aceggggtgg tgcecatect ggtcgagetg gacggcgacg taaacggcea caagttcage 972 

gtgtccggcg agggcgaggg cgatgccacc tacggcaage tgaccctgaa gttcatctgc 1032 

accaceggea agetgeecgt gccctggcce aecctcgtga ecaccetgae etacggcgtg 1092 

cagtgettea gccgctaccc egaccacatg aagcagcacg acttctteaa gtccgccatg 1152 

cccgaagget acgtceagga gcgeaccate ttcttcaagg acgacggeaa etacaagacc 1212 

cgegcegagg tgaagttcga gggcgacace . ctggtgaacc geatcgagct gaagggcatc 1272 

gactteaagg aggacggcaa catcctgggg cacaagctgg agtacaaeta caacagccac 1332 

aacgtctata teatggcega caagcagaag aacggcatca aggtgaaett caagatccgc 13 92 
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cacaacatcg aggacggcag cgtgcagctc gccgaccact accagcagaa cacccccatc 1452 

ggcgacggcc ccgtgctgct gcccgacaac cactacctga gcacccagtc cgccctgagc 1512 

aaagacccca acgagaagcg cgatcacatg gtcctgctgg agttcgtgac cgccgccggg 1572 

atcactctcg gcatggacga gctgtacaag tccggactca gatctggcgc cggcgctgga 1S32 

gccggagctg gcgccggagc cgacgaggtg gacggcgccg gcgccgatga agtagatggc 1692 

gccatgtcta ctgtccacga aatcctgtgc aagctcagct tggagggtga tcattctaca 1752 

cccccaagtg cctattga 1770 

<210> 2 

<:211> 294 

<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
GFP-DEVD-Annexin II construct 

<400> 2 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 * 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 ^ 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 BO 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 ISO 155 160 

Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp .Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro 
195 

Ser Lys Asp Pro Aan 
210 

Val Thr Ala Ala Gly 
225 

Gly Leu Arg Ser Gly 
245 

Asp Glu Val Asp Gly 
260 

Thr Val His Glu lie 
275 

Thr Pro Pro Ser Ala 
290 



Asp Asn His Tyr Leu Ser 
200 

Glu Lys Arg Asp His Met 
215 

lie Thr Leu Gly Met Asp 
230 235 

Ala Gly Ala Gly Ala Gly 
250 

Ala Gly Ala Asp Glu Val 
265 

Leu Cys Lys Leu Ser Leu 
260 

Tyr 



Thr Gin Ser Ala Leu 
205 

Val Leu Leu Glu Phe 
220 

Glu Leu Tyr Lys Sex 
240 

Ala Gly Ala Gly Ala 
255 

Asp Gly Ala Met Ser* 
270 

Glu Gly Asp His Ser 
285 



<210> 3 - 

<211> 2439 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . - <2436) 

<220> 

<223> Description of Artificial Sequence: 
EYFP-DEVD-MAPKDM construct 

<400> 3 ^ / 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc ace ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



ttc ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc gac cac atg aag 240 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 3 36 
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528 



576 



624 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac .tac cag cag aac ace ecc ate ggc gac ggc 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ecc gac aac cac tac ctg age tac cag tec gcc ctg 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag aag 72 0 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 
225 230 235 240 

gga gac gaa gtg gac/gga gcc gac etc agt ctt gtg gat gcg ttg aca 
Gly Asp Glu Val Asp Gly Ala /Asp Leu Ser Leu Val Asp Ala Leu Thr 
245 250 255 

gaa cca cct cca gaa att gag gga gaa ata aag cga gac ttc atg get 
Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met Ala 
260 265 * 270 

gcg ctg gag gca gag ccc tat gat gac ate gtg gga gaa act gtg gag 
Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu 
275 280 285 

aaa act gag ttt att cct etc ctg gat ggt gat gag aaa ace ggg aac 912 
Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 
290 295 300 

tea gag tec aaa aag aaa ccc tgc tta gac act age cag gtt gaa ggt 960 
Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly 
305 310 315 320 

ate cca tet tct aaa cca aca etc eta gee aat ggt gat cat gga atg 1008 
He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 
325 330 335 



768 



816 



864 



gag ggg aat aac act gca ggg tct cca act gac ttc ctt gaa gag aga 
Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 



1056 
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340 345 350 

gtg gac tat ccg gat tat cag age age cag aac tgg cca gaa gat gca 1104 

Val Asp Tyr Pro Asp Tyx Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala 

355 360 365 

age ttt tgt ttc cag cct cag caa gtg tta gat act gac cag get gag 1152 

Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu 

370 375 380 



ccc ttt aac gag cac cgt gat gat ggt ttg gca gat ctg etc ttt gtc 
Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 
385 390 395 400 



tea gaa gac agt tac ggt atg ctt ccc tgt gac tea ttt get tec acg 

Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr 
420 425 430 

get gtt gta tct cag gag tgg tct gtg gga gee eca aac tct cca tgt 

Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 

435 440 445 



gca aca gag etc tec aag gca gca gaa gtg gaa tea gtg aaa gag cag 
Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 

470 475 480 



465 



gac atg gaa eta etc aca gga aca gag gca gee cac get aac aat ate 
Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn He 
545 550 555 560 

ata ttg cct aca gaa cca gac gaa tct tea aec aag gat gta gca cca 
He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro 
565 570 575 



1200 



tec agt gga ccc acg aac get tct gca ttt aca gag cga gac aat cct 1248 
Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro 
405 410 415 



1296 



1344 



tea gag tec tgt gtc tec cca gag gtt act ata gaa ace eta cag cca 1392 
Ser Glu Ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro 
450 455 460 



1440 



Ctg eca get aaa gca ttg gaa acg atg gca gag cag aec act gat gtg 1488 

Leu Pro Ala Lys Ala L^u Glu Thr Met Ala Glu Gin Thr Thr Asp Val 
485 490 495 

gtg cac tct eca tee aca gac aca aca cca ggc cca gac aca gag gca 1536 

Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala 
500 505 510 

gca ctg get aaa gac ata gaa gag ate ace aag cca gat gtg ata ttg 1584 

Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val He Leu 

515 520 525 

gca aat gtc acg cag cca tct act gaa teg gat atg tte ctg gee cag 1632 

Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 
530 535 540 



1680 



1728 



cct atg gaa gaa gaa att gtc cca ggc aat gat acg aca tee ccc aaa 1776 
Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 
580 585 590 



6 



wo 00/50872 PCTAJSOO/04794 



gaa aca gag aca aca ctt cca ata aaa atg gac ttg gca cca cct gag 
Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu 
595 600 605 



gtt tea etc tea gaa ata gaa gag get etg gea aag aat gat gtt cgc 
Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg 

625 630 635 640 

tet gca gaa ata eet gtg get cag gag aca gtg gte tea gaa aea gag 

Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu 

645 650 655 

gtg gte ctg gea aea gaa gtg gta etg ccc tea gat ecc ata aea aca 

Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr Thr 

660 665 670 

ttg aca aag gat gtg aca etc cec tta gaa gca gag aga ccg ttg gtg 

Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val 

675 680 685 



cca etc cca gaa tea gaa gtg act ctg ggc aag gac gtg gtt ata ctt 
Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He Leu 
725 ^ 730 735 



gaa gag gta acc tea gte aag gac atg tct ccg tet gca gaa aca gag 
Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu 
755 760 765 



gtg gac aac age atg get cca gee tec gat ctt gca ctg cec ttg gaa 
Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu 
785 790 795 80O 



1824 



gat gtg tta ctt acc aaa gaa aca gaa eta gee cca gee aag ggc atg 1B72 
Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met 
610 615 620 



1920 



1968 



2016 



2064 



acg gac atg act cca tet ctg gaa aca gaa atg acc eta ggc aaa gag 2112 
Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu 
690 695 700 

aca get cca cec aca gaa aca aat ttg ggc atg gee aaa gac atg tct 2160 
Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser 
705 710 715 720 



2208 



cca gaa aca aag gtg get gag ttt aac aat gtg act cca ctt tea gaa 2256 
Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu 
740 745 750 



2304 



get cec ctg get aag aat get gat ctg cac tea gga aca gag ctg att 2352 
Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu He 
770 775 780 



2400 



aca aaa gta gca aca gtt cca att aaa gac aaa gga tga 243 9 

Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly 
805 810 



<210> 4 
<211> 812 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
EYFP-DEVD-MAPKDM construct 

<400> 4 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 ^ 155 160 

Gly He Lys Val Asn Phe Lys He Arg Hi^ Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro. He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 
225 230 235 240 

Gly Asp Glu Val Asp Gly Ala Asp Leu Ser Leu Val Asp Ala Leu Thr 
245 250 255 

Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met Ala 
260 265 270 

Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu 
275 280 . 285 
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Lys Thr Glu Phe lie Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 
290 295 300 

sex Glu ser Lys Lys Lys Pro Cya Leu Asp Thr Ser Gin Val Glu Gly 
305 310 315 320 

He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 
325 330 335 

Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 
340 345 350 

val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala 
355 360 365 

Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu 
370 375 380 

Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 
385 390 395 400 

Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr. Glu Arg Asp Asn Pro 
405 410 415 

Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr 
420 425 430 

Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 
435 440 445 

ser Glu Ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro 
450 455 460 

Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 
465 470 475 480 

Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val 
485 490 .495 

Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala 
500 505 510 

Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val He Leu 
515 520 525 

Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 
530 535 540 

ASD Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn He 
545 550 555 560 

He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro 
. 565 570 575 

Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 
580 585 590 

Glu Thr Glu Thr Thr Leu Pro lie Lys Met Asp Leu Ala Pro Pro Glu 
595 600 605 

ASP Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met 
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610 615 620 

Val Ser Leu Ser Glu He Qlu Glu Ala Leu Ala Lys Asn Asp Val Arg 
625 630 635 640 

Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu 
645 650 655 

Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr Thr 
660 665 670 

Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val 
675 680 685 

Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu 
690 695 700 

Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser 
705 710 715 720 

Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He Leu 
725 730 735 

Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu 
740 745 750 

Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu 
755 760 765 

Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu He 
770 775 780 

Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu 
785 790 795 800 

Thr Lys Val Ala Tlir Val Pro He Lys Asp Lys Gly 
805 810 



<210> 5 
<211> 2439 
<212> DKFA 

<213> Artificial Sequence 
<220> 

<221> CDS ^ 
<222> (1) . . (2436) 

<220> 

<223> Description of Artificial Sequence: 
EYFP-DEAD-MAPKDM construct 

<400> 5 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtc gag ctg gac' ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 
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9^9 ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 

Glu Gly Qlu Giy Asp Ala Thr Tyr Gly hys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc -etc. gtg acc acc 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



ttc ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc gac cac atg aag 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 
Ash Tyr Asii Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 



gtg acc gcc gcc ggg atc act etc ggc atg gad gag ctg tac aag ccc 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Pro 
225 230 235 240 



240 



cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 2 88 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 . 95 



336 



gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



480 



ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac grgc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 1B5 190 



624 



age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



720 



aga gac gaa gcc gac age gcc gac etc agt ett gtg gat gcg ttg aca 768 
Arg Asp Glu Ala Asp Ser Ala Asp Leu Ser Leu Val Asp Ala Leu Thr 
245 250 255 

gaa cca cet. cca gaa att gag gga gaa ata . aag - ega gac ttc atg get 816 
Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met Ala 
260 265 270 

gcg ctg gag gca gag ccc tat gat gac ate gtg gga gaa act gtg gag 864 
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Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Qly Glu Thr Val Glu 
275 280 285 

aaa act gag ttt att cct etc ctg gat ggt gat gag aaa acc ggg aac 912 
Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 
290 295 300 

tea gag tec aaa aag aaa ccc tgc tta gac act age cag gtt gaa ggt 960 
Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly 
305 310 315 320 

ate cca tct tct aaa cea aca etc eta gee aat ggt gat cat gga atg 1008 
He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 
325 330 335 

gag ggg aat aac act gca ggg tct cea act gac ttc ctt gaa gag aga 1056 
Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 
340 345 350 

gtg gac tat ccg gat tat cag age age cag aac tgg cca gaa gat gca 1104 
Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala 
355 360 365 

age ttt tgt ttc cag cct cag caa gtg tta gat act gac cag get gag 1152 
Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu 
370 375 380 

ccc ttt aac gag cac cgt gat gat ggt ttg gca gat ctg etc ttt gtc 1200 
Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 
385 390 395 400 

tec agt gga ccc acg aac get tct gca ttt aca gag cga gac aat cct 1248 

Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro 
405 410 415 

y 

tea gaa gac agt tac ggt atg ctt ccc tgt gac tea ttt get tec acg /1296 

Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr 
420 425 430 

get gtt gta tct cag gag tgg tct gtg gga gee cca aac tct cca tgt 1344 
Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 
435 440 445 

tea gag tec tgt gtc tec cca gag gtt act ata gaa acc eta cag cca 1392 
Ser Glu Ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro 
450 455 460 

gca aca gag etc tec aag gca gca gaa gtg gaa tea gtg aaa gag cag 1440 
Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 
465 470 475 480 

Ctg cca get aaa gca ttg gaa acg atg gca gag cag acc act gat gtg 1488 
Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val 
485 490 495 

gtg cac tct cca tec aca gac aca aca cca ggc cca gac aca gag gca 1536 
Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala 
500 505 510 

gca ctg get aaa gac ata gaa gag ate ace aag cca gat gtg ata ttg 1584 
Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val He Leu 
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515 520 525 

gca aat gtc acg cag cca tct act gaa teg gat atg ttc ctg gcc cag 1632 
Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 
530 535 540 

gac atg gaa eta etc aca gga aca gag gca gcc cac get aae aat ate 1680 
Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn lie 
545 550 555 S60 

ata ttg cct aca gaa cca gac gaa tct tea acc aag gat gta gca cca 172 B 
lie Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro 
565 570 575 

cct atg gaa gaa gaa att gtc cca ggc aat gat acg aca tec cec aaa 1776 
Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 
580 585 590 

gaa aca gag aca aca ctt cca ata aaa atg gac ttg gca cca cct gag 1824 
Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu 
595 600 605 

gat gtg tta ctt acc aaa gaa aca gaa eta gcc cca gcc aag ggc atg 1872 
Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met 
610 615 620 

gtt tea etc tea gaa ata gaa gag get ctg gca aag aat gat gtt egc 1920 
Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg 
625 630 635 640 

tct gca gaa ata cct gtg get cag gag aca gtg gtc tea gaa aca gag 196 8 
Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu 
645 650 655 



gtg gtc ctg gca aca gaa gtg gta ctg cec tea gat ccc ata aca aca 
Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr Thr 
660 665 670 



2016 



ttg aca aag gat gtg aca etc cec tta gaa gca gag aga ccg ttg gtg 2 0 64 
Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val 
675 680 685 

acg gac atg act cca tct ctg gaa aca gaa atg acc eta ggc aaa gag 2112 
Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu 
690 695 700 

aca get cca cec aca gaa aca aat ttg ggc atg gcc aaa gac atg tct 2160 
Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser 
705 710 715 720 

cca etc cca gaa tea gaa gtg act ctg ggc aag gac gtg gtt ata ctt 22 08 
Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He Leu 
725 730 735 

cca gaa aca aag gtg get gag ttt aae aat gtg act cca ctt tea gaa 2256 
Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu 
740 745 750 

gaa gag gta acc tea gtc aag gac atg tct ccg tct gca gaa aca gag 2304 
Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu 
755 760 765 
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get ccc ctg get aag aat get gat ctg cac tea gga aca gag etg att 2352 
Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu lie 
770 775 780 

gtg gac aac age atg get cea gee tec gat ett gca etg ecc ttg gaa 2400 
Val ASP Asn ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu 
785 790 795 800 

aea aaa gta gea aea gtt eea att aaa gac aaa gga tga 243 9 

Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly 
805 810 



<210> 6 
<211> 812 
<212> PRT 

<213> Artifieial Sequenee 
<220> 

<223> Deseription of Artificial Sequence: 
EYFP-DEAD-MAPKDM construct 

<400> 6 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 . 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
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195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Qlu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Pro 
225 230 235 240 

Arg Asp Glu Ala Asp Ser Ala Asp Leu Ser Leu Val Asp Ala Leu Thr 
245 250 255 

Glu Pro Pro Pro Glu lie Glu Gly Glu lie Lys Arg Asp Phe Met Ala 
260 265 270 

Ala Leu Glu Ala Glu Pro Tyr Asp Asp lie Val Gly Glu Thr Val Glu 
275 280 285 

Lys Thr Glu Phe lie Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 
290 295 300 

Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly 
305 310 315 320 

lie Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 
325 330 335 

Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 
340 345 350 

Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala 
355 360 365 

Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu 
370 375 380 

Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 
385 390 395 400 

Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro 
405 410 415 

Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr 
420 425 430 

Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 
435 440 445 

Ser Glu Ser Cys Val Ser Pro Glu Val Thr lie Glu Thr Leu Gin Pro 
450 455 460 

Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 
465 470 475 480 

Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val 
485 490 495 



Val His Ser Pro Ser Thr Asp Thr 
500 

Ala Leu Ala Lys Asp He Glu Glu 

515 520 



Thr Pro Gly Pro Asp Thr Glu Ala 
505 510 

He Thr Lys Pro Asp Val He Leu 
525 



15 



BNSDOCID: <W0 ^0050B72A2_I_> 



wo 00/50872 



PCT/USOO/04794 



Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 
530 535 540 

Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn He 
545 550 555 560 

He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro 
565 570 575 

Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 
580 585 590 

Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu 
595 600 605 

Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met 
610 615 620 

Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg 
625 630 635 640 

Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu 
645 650 655 

Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr Thr 
660 665 670 

Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val 
675 680 6B5 

Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu 
690 695 700 

Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser 
705 710 715 720 

Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He Leu 
725 730 735 

Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu 
740 745 750 

Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu 
755 760 765 

Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Qly Thr Glu Leu He 
770 775 780 

Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu 
785 790 795 800 

Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly 
805 810 



<210> 7 
<211> 864 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (1) . . (861) 

<220> 

<223> Description of Artificial Sequence: F25-MEK1 
construct 

<400> 7 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
6S 70 75 80 



egg cat gac ttt ttc aag agt gcc atg cec gaa ggt tat gta cag gaa 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 



85 90 95 



288 



agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 4 80 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc cge cac aac att gaa gat gga age 52 8 
Gly lie Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gcc ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 
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teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac acc 720 
Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Thr 
225 230 235 240 

ggt atg ccc aag aag aag ccg acg ccc ate cag ctg aac ceg gee cec 768 
Gly Met Pro Lys Lys Lys Pro Thr. Pro lie Gin Leu Asn Pro Ala Pro 
245 250 255 

gac ggc tet gea gtt aac ggg acc age tct gcg gag acc aac ttg gag 816 
Asp Gly Ser Ala Val Asn Gly Thr Ser Ser Ala Glu Thr Asn Leu Glu 
260 265 270 

gee ttg cag aag aag ctg gag gag eta gag ctt gat gag cag cag tga 664 
Ala Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu Gin Gin 
275 280 285 



<210> 8 
<211> 287 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: F25-MEK1 
construct 

<400> 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 - 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 ^ 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn Xle Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 
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Gly lie Lys Val Asn 
165 

Val Gin Leu Ala Asp 
180 

Pro Val Leu Leu Pro 
195 

Ser Lys Asp Pro Asn 
210 

Val Thr Ala Ala Gly 
225 

Gly Met Pro Lys Lys 
245 

Asp Gly Ser Ala Val 
260 

Ala Leu Gin Lys Lys 
275 



Phe Lys Thr Arg His Asn 
170 

His Tyr Gin Gin Asn Thr 
185 

Asp Asn His Tyr Leu Ser 
200 

Glu Lys Arg Asp His Met 
215 

lie Thr His Gly Met Asp 
230 235 

Lys Pro Thr Pro lie Gin 
250 

Asn Gly Thr Ser Ser Ala 
265 

Leu Glu Glu Leu Glu Leu 
260 



lie Glu Asp Gly Ser 
175 

Pro lie Gly Asp Gly 
i90 

Thr Gin Ser Ala Leu 
205 

Val Leu Leu Glu Phe 
220 

Glu Leu Tyr Asn Thr 
240 

Leu Asn Pro Ala Pro 
255 

Glu Thr Asn Leu Glu 
270 

Asp Glu Gin Gin 
285 



<210> 9 

<211> 876 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (873) 



<220> 

<223> Description of Artificial Sequence: F25-MEK2 
construct 



<400> 9 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

9^9 99t gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc act act ggc aaa ctg cet gtt cca tgg cca aca eta gtc act act 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 ' 80 

egg cat gac ttt ttc aag agt gee atg cec gaa ggt tat gta cag gaa 2 88 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
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85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

gtc aag ttt gaa ggt gat ace ctt gtt aat aga ate gag tta aaa ggt 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gea gac aaa caa aag aat 4 80 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 52 8 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gea gac cat tat caa caa aat act cca att ggc gat ggc 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 



ect gtc ctt tta cea gac aac cat tac ctg tec aca caa tct gee ctt 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat cec aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac acc 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Thr 

225 230 235 240 

ggt atg ctg gee egg agg aag ccg gtg ctg ccg gcg etc acc ate aac 

Gly Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu Thr He Asn 

245 250 255 



624 



672 



720 



76B 



ect acc ate gee gag ggc cca tec ect acc age gag ggc gee tee gag 816 
Pro Thr He Ala Glu Gly Pro Ser Pro Thr Ser Glu Gly Ala Ser Glu 
260 265 270 

gea aac ctg gtg gac ctg cag aag aag ctg gag gag ctg gaa ctt gac 8 64 
Ala Asn Leu Val Asp Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp 
275 280 285 

gag eag cag taa 876 
Glu Gin Gin 
290 



<210> 10 
<211> 291 
<212> PRT 

<213> Artificial Sequence 
<220> 
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.<223> Description of Artificial Sequence: F25-MEK2 
construct 

<400> 10 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
SO 55 60 

Leu Cys Tyr Gly Val Gin cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Lea Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Thr 
225 230 235 240 

Gly Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu Thr He TVsn 
245 250 255 

Pro Thr He Ala Glu Gly Pro Ser Pro Thr Ser Glu Gly Ala Ser Glu 
260 265 270 

Ala Asn Leu Val Asp Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp 
275 280 285 

Glu Gin Gin 
290 
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<2i0> 11 
<211> 889 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<221> CDS 

<222> (1) . . (888) 

<220> 

<223> Description of Artificial Sequence: Caspase 
3 -DEVD- substrate construct 



<400> 11 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cea att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 



gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg' cct gtt cea tgg cca aca eta gtc . act act 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat eat atg aaa 240 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg eat gae ttt ttc aag agt gee atg cec gaa ggt tat gta cag gaa 2 88 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

agg acc ate ttc ttc aaa gat gae ggc aac tac aag aca cgt get gaa 3 36 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 384 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

att gae ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gae aaa caa aag aat 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

145 150 155 160 



gga ate aaa gtg aac ttc aag ace cgc cac aac att gaa gat gga age 52 8 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gae cat tat caa caa aat act cca att ggc gat ggc 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
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180 



185 190 



cct qtc ctt tta cca gac aac cat tac ctg tec aca caa tot gcc ctt 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 

ser Lys Asp Pro Asn Glu Lys Arg Asp His . Met Val Leu Leu Glu Phe 

210 215 220 

gta aca get get ggg att aca eat ggc atg gat gaa etg tac aac tec 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 

225 230 235 240 

qga aga agg aaa ega caa aag cga teg get gtt aaa tct gaa gga aag 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Val Lys Ser Glu Gly Lys 

245 250 255 



tct act gtc cac gaa ate ctg tgc aag etc age ttg gag ggt gtt cat 
Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val Hxs 
275 280 285 

tct aca ccc cca agt aec egg ate c 
Ser Thr Pro Pro Ser Thr Arg He - 
290 295 



<210> 12 
<211> 296 
<212> PRT 

<213> Artificial Sequence 
<220> / 

<223> Description of Artificial Sequence: Caspase 
3 -DEVD- substrate construct 

<400> 12 . X 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu CVS Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 . 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



23 



624 



672 



720 



768 



aga aag tgt gac gaa gtt gat gga att gat gaa gta gca agt act atg 816 
Arq Lys Cys Asp Glu Val Asp Gly He Asp Glu Val Ala Ser Thr Met 
260 265 270 



864 



889 
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Val Lys Phe Qlu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 .125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp. Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Val Lys Ser Glu Gly Lys 
245 . 250 255 

Atq Lvs Cys Asp Glu Val Asp Gly He Asp Glu Val Ala Ser Thr Met 
260 265 270 

Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His 
275 280 285 

Ser Thr Pro Pro Ser Thr Arg He 

290 295 / 



<210> 13 
<211> 846 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (846) 

<220> 

<223> Description of Artificial Sequence: Caspase 
6-VEID-substrate construct 

<400> 13 

atg get age aaa- gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc.tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
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Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



egg cat gac ttt ttc aag agt gcc atg ecc gaa ggt tat gta cag gaa 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 



gga ate aaa gtg aac ttc aag acc cgc cac aae att gaa gat gga age 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 



teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca eat ggc atg gat gaa ctg tac aac tec 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 

225 230 235 240 



gaa gga gta cac agt aca cca cca age gca 
Glu Gly Val His Ser Thr Pro Pro Ser Ala 



288 



agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



384 



att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 . 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



528 



gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Giy Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca ea;a tct gcc ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



672 



720 



gga aga agg aaa cga caa aag ega teg aca aga ctt gtt gaa att gac 768 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Thr Arg Leu Val Glu He Asp 

245 250 255 

aac agt act atg age aca gta cac gaa att tta tgt aaa tta age tta 816 

Asn Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu 
260 265 270 



846 
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275 



280 



<210> 14 
<211> 282 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 
6 -VEID- substrate construct 

<400> 14 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Thr Arg Leu Val Glu lie Asp 



245 



250 



255 
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Asn Ser Tlir. Met Ser Thr Val Hie Glu lie Leu Cys Lys Leu Ser Leu 
260 2SS 270 

Glu Gly Val His Ser Thr Pro Pro Ser Ala 
275^ . 280 



<210> 15 
<211> 876 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . - (876) 

<220> 

<223> Description of Artificial Sequence: Caspase 8-VETD 
construct 

<40P> 15 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 



gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 - 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 

Cys Thr Thr Gly Lys Leu Pro Val Pro- Trp Pro Thr Leu Val Thr Thr 

50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 

Leu Cys Tyr Gly Val Gin Cys Phe Ser TVrg Tyr Pro Asp His Met Lys 

65 70 75 80 

egg eat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 



96 



144 



192 



240 



288 



agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr* 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gea gac aaa caa aag aat 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



27 



BNSEKXID: <WO_0O50B72A2_l_> 



wo 00/50872 



PCT/USOO/04794 



gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 528 

Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
IBO 1B5 190 

ect gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

teg aaa gat cce aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 



gga aga age aaa cga caa aag cga teg tat gaa aaa gga ata eca gtt 
Gly Arg Ser Lys Arg Gin Lys Arg Ser Tyr Glu Lys Gly lie Pro Val 
245 250 255 



gaa ate ctg tgc aag etc age ttg gag ggt gtt cat tct aca cee eca 
Glu lie Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr Pro Pro 
275 280 285 



agt gcc gga tec 
Ser Ala Gly Ser 
290 



<210> 16 
<211> 292' 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 8-VETD 
construct 

<400> 16 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



624 



672 



gta aea get get ggg att aca cat ggc atg gat gaa etg tac aae tec 720 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 



768 



gaa aca gac age gaa gag caa get tat agt act atg tct act gtc cae 816 
Glu Thr Asp Ser Glu Glu Gin Ala Tyr Ser Thr Met Ser Thr Val His 
260 265 .270 



864 



876 
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Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
lis 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Ser Lys Arg Gin Lys Arg Ser Tyr Glu Lys Gly He Pro Val 
245 250 255 

Glu Thr Asp Ser. Glu Glu Gin Ala Tyr Ser Thr Met Ser Thr Val His 
260 265 270 

Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr Pro Pro 
275 280 285 

Ser Ala Gly Ser 
290 



<210> 17 
<211> 906 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (906) 

<220> 

<2 23> Description of Artificial Sequence: Gas 3 -multiple 
DEVD construct 

<400> 17 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

1 . 5 . 10 15 
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gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Qly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 4 0 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat gac ttt ttc aag agt gcc atg ccc gaa ggt tat gta cag gaa 2 88 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tae 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea eac aat gta tac ate atg gca gac aaa caa aag aat 460 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aae ttc aag acc cgc cac aac att gaa gat gga age 528 
Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gae cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tet gee ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 72 0 
Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

gga aga agg aaa cga caa aag cga teg gca ggt gac gaa gtt gat gca 768 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 
245 250 255 
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ggt gac gaa gtt gat gca ggt gac gaa gtt gat gca ggt gac gaa gtt 816 
Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

gac gca ggt agt act atg tct act gtc cac gaa ate ctg tgc aag etc 864 
Asp Ala Gly Ser Thr Met Ser Thr Val His Glu lie Leu Cys Lys Leu 
275 280 285 

age ttg gag ggt gtt cat tct aca ccc cca agt gcc gga tec 906 
Ser Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
290 295 300 



<210> IB 
<21X> 302 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gas 3 -multiple 
DEVD construct 

<400> 18 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 / 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu. 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
lis 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 IBS 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 
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Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thx Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 
245 250 255 

Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

Asp Ala Gly Ser Thr Met Ser Thr Val His Glu lie Leu Cys Lys Leu 
275 280 285 

Ser Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
290 295 300 



<210> 19 
<211> 906 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1) . . (885) . 
<220> 

<223> Description of Artificial Sequence: Caspase 
8 -multiple VETD construct 

<400> 19 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

1 5 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ceg gat cat atg aaa 240 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat $ac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 2 88 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 3 3 6- 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 
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gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 X35 140 



aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



480 



624 



672 



720 



gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 528 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 1B5 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 
Ser Lys Asp Pro' Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

gga aga agg aaa cga caa aag cga teg gca ggt gtt gaa aca gac gca 768 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Val Glu Thr Asp Ala 
/ 245 250 255 

ggt gtt gaa aca gac gca ggt gtt gaa aca gac gca ggt gtt gaa aca 816 
Gly Val Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Val Glu Thr 
260 265 270 

gac gca ggt agt act atg tct act gtc cac gaa ate ctg tgc aag etc 864 
Asp Ala Gly Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu 
275 280 285 

age ttg gag ggt gtt cat tct acaeccccaa gtgceggatc e 906 
Ser Leu Glu Gly Val His Ser 
290 295 



<210> 20 
<211> 295 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 
B -multiple VETD construct 

<400> 20 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
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15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25. 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Tiir Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
IBO 185 190 

tro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195" 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Val Glu Thr Asp Ala 
245 250 255 

Gly Val Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Val Glu Thr 
260 265 270 

Asp Ala Gly Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu 
275 280 285 

Ser Leu Glu Gly Val His Ser 
290 295 



<210> 21 
<211> 4833 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (1) . . (4830) 

<220> 

<223> Description of Artificial Sequence: 
EYFP-DEVD-MAP4-EBFP construct 

<400> 21 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ttc ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc gac cac atg aag 240 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 BO 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 2 88 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 
Arg Thr <rie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly" Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 460 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 
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age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 



210 2X5 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag aag 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 

225 230 235 240 



720 



gga gac gaa gtg gac gga atg gcc gac etc agt ett gtg gat gcg ttg 768 
Gly Asp Glu Val Asp Gly Met Ala Asp Leu Ser Leu Val Asp Ala Leu 
245 250 255 

aca gaa cca cct cca gaa att gag gga gaa ata aag cga gac ttc atg 816 
Thr Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met 
260 265 270 

get gcg ctg gag gca gag ccc tat gat gac ate gtg gga gaa act gtg 664 
Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val 
275 280 285 

gag aaa act gag ttt att cct etc ctg gat ggt gat gag aaa acc ggg 912 
Glu Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly 
290 295 300 

aac tea gag tec aaa aag aaa ccc tgc tta gac act age cag gtt gaa 960 
Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu 
305 310 315 320 

ggt ate cca tct tct aaa cca aca etc eta gcc aat ggt gat cat gga 
Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly 
325 330 335 

atg gag ggg aat aac act gca ggg tct cca act gac ttc ett gaa gag 
Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu. 
^ 340 345/ 350 

aga gtg gac tat ecg gat tat cag age age eag aac tgg cca gaa gat 
Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp 
355 360 365 

gca age ttt tgt ttc cag ect eag caa gtg tta gat act gac cag get 1152 
Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala 
370 375 380 

gag ccc ttt aac gag cac cgt gat gat ggt ttg gca gat ctg etc ttt 1200 
Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe 
385 390 395 400 

gtc tec agt gga ccc acg aac get tct gca ttt aca gag cga gac aat 
Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn 
405 410 415 

cct tea gaa gac agt tac ggt atg ett ccc tgt gac tea ttt get tec 1296 
Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser 
420 425 430 

acg get gtt gta tct eag gag tgg tct gtg gga gcc cca aac tct cca 1344 
Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro 
435 440 445 



1008 



1056 



1104 



1248 
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tgt tea gag tec tgt gtc tec cca gag gtt act ata gaa acc eta cag 13 92 
Cya Ser Glu Ser Cys Val Ser Pro Glu Val Thr lie Glu Thr Leu Gin 
450 455 460 

cca gca aca gag etc tec aag gca gca gaa gtg gaa tea gtg aaa gag 1440 
Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu 
465 470 475 480 

cag ctg cca get aaa gca ttg gaa acg atg gca gag cag acc act gat 14 88 
Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp 
485 490 495 

gtg gtg cac tct eea tec aca gac aca aca eca ggc cca gac aca gag 1536 
Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu 
500 505 SIO 

gca gca ctg get aaa gac ata gaa gag ate acc aag cca gat gtg ata 15 84 
Ala Ala Leu Ala Lys Asp lie Glu Glu He Thr Lys Pro Asp Val He 
515 520 525 

ttg gca aat gtc acg cag cca tct act gaa teg gat atg ttc ctg gcc 1632 
Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala 
530 535 540 



cag gac atg gaa eta etc aca gga aca gag gca gcc cac get aac aat 
Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn 
545 550 555 560 



cca cet atg gaa gaa gaa att gtc cca ggc aat gat acg aca tec ccc 
Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro 
580 585 590 



gag gat gtg tta ctt acc aaa gaa ac^ gaa eta gcc cca gcc aag ggc 
Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly 
610 615 620 



1680 



ate ata ttg cet aca gaa cca gac gaa tct tea acc aag gat gta gca 172 8 
He He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala 
565 570 575 



1776 



aaa gaa aca gag aca aca ctt cca ata aaa '"atg gac ttg gca cca cet 1824 
Lys Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro 
595 600 605 



1872 



atg gtt tea etc tea gaa ata gaa gag get ctg gca aag aat gat gtt 1920 
Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val 
625 630 635 640 

cgc tct gca gaa ata cet gtg get cag gag aca gtg gtc tea gaa aca 1968 
Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr 
645 650 655 

gag gtg gtc ctg gca aca gaa gtg gta ctg eec tea gat ccc ata aca 2016 
Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr 
660 665 670 

aca ttg aca aag gat gtg aca etc ccc tta gaa gca gag aga ccg ttg 2064 
Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu 
675 680 685 

gtg acg gac atg act eea tct ctg gaa aca gaa atg acc eta ggc aaa 2112 
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Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys 
690 695 700 

gag aca get cca ccc aca gaa aca aat ttg ggc atg gcc aaa gac atg 2160 
Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met 
705 710 715 720 

tct cca etc cca gaa tea gaa gtg act ctg ggc aag gac gtg gtt ata 2208 
Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val lie 
725 730 735 

ctt cca gaa aca aag gtg get gag ttt aac aat gtg act cca ctt tea 2256 
Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser 
740 745 750 

gaa gaa gag gta ace tea gtc aag gac atg tet ccg tct gca gaa aca 2304 
Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr 
755 760 765 

gag get ccc ctg get aag aat get gat ctg cae tea gga aca gag ctg 2352 
Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu 
770 775 780 

att gtg gac aac age atg get cca gcc tec gat ctt gca ctg ccc ttg 24 00 
lie Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu 
785 790 795 800 

gaa aca aaa -gta gca aca gtt cca att aaa gac aaa gga act gta cag 2448 
Glu Thr Lys Val Ala Thr Val Pro lie Lya Asp Lys Gly Thr Val Gin 
805 810 815 

act gaa gaa aaa cea cgt gaa gac tec cag tta gca tct atg cag cae 24 96 
Thr Glu Glu Lys Pro Arg Glu Asp Ser Gin Leu Ala Ser Met Gin His 
820 625 830 

aag gga cag tea aca gta cct cct tgc acg get tea cca gaa cca gtc 2544 
Lys Gly Gin Ser Thr Val Pro Pro Cys Thr Ala Ser Pro Glu Pro Val 
B35 840 845 

aaa get gca gaa caa atg tct ace tta cca ata gat gca cct tct cca 2592 
Lys Ala Ala Glu Gin Met Ser Thr Leu Pro lie Asp Ala Pro Ser Pro 
850 ess 860 

tta gag aac tta gag cag aag gaa acg cct ggc age cag cct tct gag 2640 
Leu Glu Asn Leu Glu Gin Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu 
865 iB70 875 880 

cct tgc tea gga gta tec egg caa gaa gaa gca aag get get gta ggt 2688 
Pro Cys Ser Gly Val Ser Arg Gin Glu Glu Ala Lys Ala Ala Val Gly 
885 890 895 

gtg act gga aat gac ate act ace ccg cea aac aag gag cea cca cca 2736 
Val Thr Gly Asn Asp lie Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro 
900 905 910 

age cca gaa aag aaa gca aag cct ttg gee acc act caa cct gca aag 2 784 
Ser Pro Glu Lys Lys Ala Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys 
915 920 925 

act tea aca teg aaa gcc aaa aca cag ccc act tct etc cct aag caa 2 832 
Thr Ser Thr Ser Lys Ala Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin 
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930 935 940 

cca get ccc acc acc tct ggt ggg ttg aat aaa aaa ccc atg age etc 2880 

Pro Ala Pro Thr Thr Ser Gly Gly Leu Asn Lys Lys Pro Met Ser Leu 
945 950 955 960 



gee tea ggc tea gtg cca get gee cca cae aaa cge ect get get gcc 
Ala Ser Gly Ser Val Pro Ala Ala Pro His Lys Arg Pro Ala Ala Ala 
965 970 975 

act get act gcc agg ect tec acc eta cet gee aga gac gtg aag cca 
Thr Ala Thr Ala Arg Pro Ser Thr Leu Pro Ala Arg Asp Val Lys Pro 
9B0 985 990 

aag eea att aca gaa get aag gtt gcc gaa aag egg acc tct cea tec 
Lys Pro lie Thr Glu Ala Lys Val Ala Glu Lys Arg Thr Ser Pro Ser 
995 1000 1005 

aag cet tea tct gcc cea gee etc aaa cet gga ect aaa acc acc cca 
Lys Pro Ser Ser Ala Pro Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro 
1010 1015 1020 

acc gtt tea aaa gcc aca tct ccc tea act ett gtt tec act gga cea 
Thr Val Ser Lys Ala Thr Ser Pro Ser Thr Leu Val Ser Thr Gly Pro 
1025 1030 1035 1040 

agt agt aga agt cca get aca act ctg cet aag -agg cca acc age ate 
Ser Ser Arg Ser Pro Ala Thr Thr Leu Pro Lys Arg Pro Thr Ser lie 
1045 1050 1055 



aag aga aac ace act ccc act ggg gca gea ccc cea gca -ggg atg act 

Lys Arg Asn Thr Thr Pro Thr Gly Ala Ala Pro Pro Ala Gly Met Thr 

1090 1095 1100 

tec act cga gtc aag ccc atg tct gca cet age cge tct tct ggg get 

Ser Thr Arg Val Lys Pro Met Ser Ala Pro Ser Arg Ser Ser Gly Ala 

1105 1110 - 1115 1120 

ett tct gtg gac aag aag ccc act tec act aag cet age tec tct get 

Leu Ser Val Asp Lys Lys Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala 

1125 1130 1135 



agt gtt cge tec aag gtc ggc tct aca gaa aac ate aaa cae cag cet 
Ser Val Arg Ser Lys Val Gly Ser Thr Glu Asn He Lys His Gin Pro 
1155 1160 1165 



2928 



2976 



3024 



3072 



3120 



3168 



aag act gag ggg aaa cet get gat gtc aaa agg atg act get aag tct 3216 
Lys Thr Glu Gly Lys Pro Ala Asp Val Lys Arg Met Thr Ala Lys Ser 
1060 1065 1070 

gee tea get gac ttg agt cge tea aag acc acc tct gee agt tct gtg 3264 
Ala Ser Ala Asp Leu Ser Arg; Ser Lys Thr Thr Ser Ala Ser Ser Val 
1075 1080 1085 



3312 



3360 



3408 



ccc agg gtg age cge ctg gcc aca act gtt tct gcc ect gac ctg aag 3456 
Pro Arg Val Ser Arg Leu Ala Thr Thr Val Ser Ala Pro Asp Leu Lys 
1140 1145 1150 



3504 



gga gga ggc egg gee aaa gta gag aaa aaa aca gag gca get ace aca 3 552 
Gly Gly Gly Arg Ala Lys Val Glu Lys Lys Thr Glu Ala Ala Thr Thr 
1170 1175 1180 
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get ggg aag cct gaa cct aat gca gtc act aaa gca gcc ggc tec att 3600 
Ala Gly Lys Pro Glu Pro Asn Ala Val Thr Lys Ala Ala Gly Ser He 
1185 1190 1195 120O 

gcg agt gca cag aaa ccg cct get ggg aaa gtc cag ata gta tec aaa 3648 
Ala Ser Ala Gin Lys Pro Pro Ala Gly Lys Val Gin He Val Ser Lys 
1205 1210 1215 

aaa gtg age tac agt cat att caa tec aag tgt gtt tec aag gac aat 3696 
Lys Val Ser Tyr Ser His He Gin Ser Lys Cys Val Ser Lys Asp Aen 
1220 1225 1230 

att aag cat gtc cct gga tgt ggc aat gtt cag att cag aac aag aaa 3744 
He Lys His Val Pro Gly Cys Gly Asn Val Gin He Gin Asn Lys Lys 
1235 1240 1245 

gtg gac ata tec aag gtc tec tec aag tgt ggg tec aaa get aat ate 3 792 
Val Asp He Ser Lys Val Ser Ser Lys Cys Gly Ser Lys Ala Asn He 
1250 1255 1260 

aag cac aag cct ggt gga gga gat gtc aag att gaa agt cag aag ttg 3 640 
Lys His Lys Pro Gly Gly Gly Asp Val Lys He Glu Ser Gin Lys Leu 
1265 1270 1275 1280 

aac ttc aag gag aag gcc caa gcc aaa gtg gga tec ctt gat aac gtt 3 88 8 
Asn Phe Lys Glu Lys Ala Gin Ala Lys Val Gly Ser Leu Asp Asn Val 
1285 1290 129a 

ggc cac ttt cct gca gga ggt gee gtg aag act gag ggc ggt ggc agt 3936 
Gly His Phe Pro Ala Gly Gly Ala Val Lys Thr Glu Gly Gly Gly Ser 
1300 1305 1310 

gag gcc ctt ccg tgt cca ggc cec cec get ggg gag gag cca gtc ate 3984 
Glu Ala Leu Pro Cys Pro Gly Pro Pro Ala Gly Glu Glu Pro Val He 
1315 13^20 1325 

cct gag get gcg cct gac egt ggc gcc cct act tea gcc agt ggc etc 4032 
Pro Glu Ala Ala Pro Asp Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu 
1330 1335 1340 

agt ggc cac acc ace ctg tea ggg ggt ggt gac caa agg gag ecc cag 4 080 
Ser Gly His Thr Thr Leu Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin 
1345 1350 1355 1360 

acc ttg gac age cag ate cag gag aea age ate atg gtg age aag ggc 412 8 
Thr Leu Asp Ser Gin He Gin Glu Thr Ser He Met Val Ser Lys Gly 
1365 1370 1375 

gag gag ctg ttc acc ggg gtg gtg ecc ate ctg gtc gag ctg gac ggc 4176 
Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 
1380 1385 1390 

gac gta aac ggc cac aag ttc age gtg tec ggc gag ggc gag ggc gat 4224 
Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
1395 1400 1405 

gee acc tac ggc aag ctg ace ctg aag ttc ate tge acc acc ggc aag 4272 
Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 
1410 1415 1420 
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ctg ccc gtg ccc tgg ccc acc etc gtg acc acc ctg acc cac ggc gtg 4320 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr His Gly Val 
1425 1430 1435 1440 

cag tgc ttc age cgc tac ccc gac cac atg aag cag cac gac ttc ttc 4368 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 
1445 1450 1455 



aag tec gcc atg ccc gaa ggc tac gtc cag gag cgc acc ate ttc ttc 
Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe 
1460 1465 1470 



gag aag cgc gat cac atg gtc ctg ctg gag ttc gtg acc gcc gcc ggg 
Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
1585 1590 1595 1600 



4416 



aag gac gac ggc aac tac aag acc cgc gcc gag gtg aag ttc gag ggc 4464 
Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
1475 1480 1485 

gac acc ctg gtg aac cgc ate gag ctg aag ggc ate gac ttc aag gag 4512 
Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He Asp Phe Lys Glu 
1490 1495 1500 

gac ggc aac ate ctg ggg cac aag ctg gag tac aac ttc aac age cac 4560 
Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His 
1505 1510 1515 1520 

aac gtc tat ate atg gcc gac aag cag aag aac ggc ate aag gtg aac 4608 
Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 
1525 1530 1535 

ttc aag ate cgc cac aac ate gag gac ggc age gtg cag etc gcc gac 4 65 6 
Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
1540 1545 1550 

cac tac cag cag aac acc ccc ate ggc gac ggc ccc gtg ctg ctg ccc 4704 
His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro 
1555 1560 1565 

gac aac cac tac ctg age acc cag tec gcc ctg age aaa gac ccc aac 4752 
Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
1570 1575 1580 



4800 



ate act etc ggc atg gac gag ctg tac aag tag 4833 
He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1605 1610 



<:210> 22 
<211> 1610 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
EYFP-DEVD-MAP4-EBFP construct 

<400> 22 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp Hie Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin- Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser- Tyr Gin Ser Ala Leu 
195 ^00 ^ 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr- Lys Lys 
225 230 235 240 

Gly Asp Glu Val Asp Gly Met Ala Asp Leu Ser Leu Val Asp Ala Leu 
245 250 255 

Thr Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met 
260 265 270 

Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val 
275 280 285 

Glu Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly 
290 295 300 

Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu 
305 310 315 320 

Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly 
325 330 335 
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Met Glu Gly Asn Aen Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu 
340 345 350 

Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp 
355 360 365 

Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala 
370 375 380 

Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe 
385 390 395 400 

Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn 
405 410 415 

Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser 
420 425 430 

Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro 
435 440 445 

Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr lie Glu Thr Leu Gin 
450 455 460 

Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu 
465 470 475 480 

Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp 
485 490 495 

Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu 
500 505 510 

Ala Ala Leu Ala Lys Asp lie Glu Glu lie Thr Lys Pro Asp Val lie 
515 520 525 

y 

Leu Ala Asn Val Thr Gin Pro Ser^Thr Glu Ser Asp Met Phe Leu Ala 
530 535 540 

Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn 
545 550 555 560 

He He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala 
565 570 575 

Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro 
580 585 590 

Lys Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro 
595 600 605 

Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly 
610 615 620 

Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val 
625 630 635 640 

Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr* 
645 650 655 

Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr- 
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660 665 670 

Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu 
675 680 685 

Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys 
690 695 700 

Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met 
705 710 715 720 

Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val lie 
725 730 735 

Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser 
740 745 750 

Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr 
755 760 765 

Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu 
770 775 780 

He Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu 
785 790 795 800 

Glu Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly Thr Val Gin 
805 810 815 

Thr Glu Glu Lys Pro Arg Glu Asp Ser Gin Leu Ala Ser Met Gin His 
820 825 830 

Lys Gly Gin Ser Thr Val Pro Pro Cys Thr Ala Ser Pro Glu Pro Val 
835 840 845 

Lys Ala Ala Glu Gin Met Ser Thr Leu Pro He Asp Ala Pro Ser Pro 
650 855 860 

Leu Glu Asn Leu Glu Gin Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu 
865 870 875 880 

Pro Cys Ser Gly Val Ser Arg Gin Glu Glu Ala Lys Ala Ala Val Gly 
885 890 895 

Val Thr Gly Asn Asp He Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro 
900 905 910 

Ser Pro Glu Lys Lys Ala Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys 
915 920 925 

Thr Ser Thr Ser Lys Ala Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin 
930 935 940 

Pro Ala Pro Thr Thr Ser Gly Gly Leu Asn Lys Lys Pro Met Ser Leu 
945 950 955 960 

Ala Ser Gly Ser Val Pro Ala Ala Pro His Lys Arg Pro Ala Ala Ala 
965 970 975 

Thr Ala Thr Ala Arg Pro Ser Thr Leu Pro Ala Arg Asp Val Lys Pro 
980 985 990 
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Lys Pro lie Thr Glu AJa Lye Val Ala Glu Lys Arg Thr Ser Pro Ser 
995 1000 1005 

Lys Pro Ser Ser Ala Pro Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro 
1010 1015 1020 

Thr Val Ser Lys Ala Thr Ser Pro Ser Thr Leu Val Ser Thr Gly Pro 
1025 1030 1035 1040 

Ser Ser Arg Ser Pro Ala Thr Thr Leu Pro Lys Arg Pro Thr Ser lie 
1045 1050 1055 

Lys Thr Glu Gly Lys Pro Ala Asp Val Lys Arg Met Thr Ala Lys Ser 
1060 1065 1070 

Ala Ser Ala Asp Leu Ser Arg Ser Lys Thr Thr Ser Ala Ser Ser Val 
1075 1080 1085 

Lys Arg Asn Thr Thr Pro Thr Gly Ala Ala Pro Pro Ala Gly Met Thr 
1090 1095 1100 

Ser Thr Arg Val Lys Pro Met Ser Ala Pro Ser Arg Ser Ser Gly Ala 
1105 1110 1115 1120 

Leu Ser Val Asp Lys Lys Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala 
1125 1130 1135 

Pro Arg Val Ser Arg Leu Ala Thr Thr Val Ser Ala Pro Asp Leu Lys 
1140 1145 1150 

Ser Val Arg Ser Lys Val Gly Ser Thr Glu Asn lie Lys His Gin Pro 
1155 1160 1165 

Gly Gly Gly Arg Ala Lys Val Glu Lys Lys Thr Glu Ala Ala Thr Thr 
1170 1175 1180 / 

Ala Gly Lys Pro Glu Pro Asn Ala Val Thr Lys Ala Ala Gly Ser lie 
1185 1190 1195 1200 

Ala Ser Ala Gin Lys Pro Pro Ala Gly Lys Val Gin He Val Ser Lys 
1205 1210 1215 

Lys Val Ser Tyr Ser His He Gin Ser Lys Cys Val Ser Lys Asp Asn 
1220 1225 1230 

He Lys His Val Pro Gly Cys Gly Asn Val Gin He Gin Asn Lys Lys 
1235 1240 1245 

Val Asp He Ser Lys Val Ser Ser Lys Cys Gly Ser Lys Ala Asn He 
1250 1255 1260 

Lys His Lys Pro Gly Gly Gly Asp Val Lys He Glu Ser Gin Lys Leu 
1265 1270 1275 1280 

Asn Phe Lys Glu Lys Ala Gin Ala Lys Val Gly Ser Leu Asp Asn Val 
1285 1290 1295 

Gly His Phe Pro Ala Gly Gly Ala Val Lys Thr Glu Gly Gly Gly Ser 
1300 1305 1310 
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Glu Ala Leu Pro Cys Pro Gly Pro Pro Ala Gly Glu Glu Pro Val lie 
1315 1320 1325 

Pro Glu Ala Ala Pro Asp Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu 
1330 1335 1340 

Ser Gly His Thr Thr Leu Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin 
1345 1350 1355 1360 

Thr Leu Asp Ser Gin He Gin Glu Thr Ser He Met Val Ser Lys Gly 
1365 1370 1375 

Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 
1380 13B5 1390 

Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
1395 1400 1405 

Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 
1410 1415 1420 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr His Gly Val 
1425 1430 1435 1440 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 
1445 1450 1455 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 
1460 1465 1470 

Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
1475 1480 1485 

Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu 
1490 1495 1500 

Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His 
1505 1510 .1515 1520 

Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 
1525 1530 1535 

Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
1540 1545 1550 



His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
1555 1560 1565 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
1570 1575 1580 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
1585 1590 1595 1600 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1605 1610 



<210> 23 
<211> 978 
<212> DNA 
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<220> 

<221> CDS 

<222> (1) . . (978) 

<220> 

<223> Description of Artificial Sequence: 

GPP -nucleolus -Caspase 8-annexin II construct 

<400> 23 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 . 10 15 



gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

egg cat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr' Val Gin Glu 

85 . ^0 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 



96 



144 



192 



240 



288 



/ 336 



gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 



432 



aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 480 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc ege cac aac att gaa gat gga age 528 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
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195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lye Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aea cat ggc atg gat gaa ctg tac aac tec 720 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 • 230 235 240 

gga aga aaa cgt ata cgt act tac etc aag tec tgc agg egg atg aaa 7 68 
Gly Arg Lys Arg He Arg Thr Tyr Leu Lys Ser Cys Arg Arg Met Lys 
245 250 255 

aga agt ggt ttt gag atg tct cga cet att cct tec cac ctt act ega 816 
Arg Ser Gly Phe Glu Met Ser Arg Pro He Pro Ser His Leu Thr Arg 
260 265 270 

teg gca ggt gtt gaa aea gac gca ggt gtt gaa aca gac gca ggt gtt 864 
Ser Ala Gly Val Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Val 
275 280 285 

gaa aca gac gca ggt gtt gaa aca gac gca ggt agt act atg tct act 912 
Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Ser Thr Met Ser Thr 
290 295 300 

gtc cac gaa ate ctg tgc aag etc age ttg gag ggt gtt cat tct aca 960 
Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr 
305 310 315 320 

ccc cea agt gee gga tec ^'^8 
Pro Pro Ser Ala Gly Ser 
325 



<210> 24 
<211> 326 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

GFP-nucleolus-Caspase 8-annexin II construct 

<400> 24 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
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Arg Thr He Phe Phe 
100 

Val Lys Phe Glu Gly 
115 

He Asp Phe Lys Glu 
130 

Asn Tyr Asn Ser His 
145 

Gly He Lys Val Asn 
165 

Val Gin Leu Ala Asp 
180 

Pro Val Leu Leu Pro 
195 

Ser Lys Asp Pro Asn 
210 

Val Thr Ala Ala Gly 
225 

Gly Arg Lys Arg He 
245 

Arg Ser Gly Phe Glu 
260 

Ser Ala Gly Val Glu 
275 

Glu Thr Asp Ala Gly 
290 

Val His Glu He Leu 
305 

Pro Pro Ser Ala Gly 
325 



90 

Lys Asp Asp Gly Asn Tyr 
105 

Asp Thr Leu Val Asn^Arg 
120 

Asp Gly Asn He Leu Gly 
135 

Asn Val Tyr He Met Ala 
150 155 

Phe Lys Thr Arg His Asn 
170 

His Tyr Gin Gin Asn Thr 
185 

Asp Asn His Tyr Leu Ser 
200 

Glu Lys Arg Asp His Met 
215 

He Thr His Gly Met Asp 
230 235 

Arg Thr Tyr Leu Lys Ser 
250 

Met Ser Arg Pro He Pro 
265 

Thr Asp Ala Gly Val Glu 
/ 280 

Val Glu Thr Asp Ala Gly 
295 

Cys Lys Leu Ser Leu Glu 
310 315 

Ser 



95 

Lys Thr Arg Ala Glu 
110 

He Glu Leu Lys Gly 
125 

His Lys Leu Glu Tyr 
140 

Asp Lys Gin Lys Asn 
160 

He Glu Asp Gly Ser 
175 

Pro He Gly Asp Gly 
190 

Thr Gin Ser Ala Leu 
205 

Val Leu Leu Glu Phe 
220 

Glu Leu Tyr Asn Ser 
240 

Cys Arg Arg Met Lys 
255 

Ser His Leu Thr Arg 
270 

Thr Asp Ala Gly Val / 
285 

Ser Thr Met Ser Thr 
300 

Gly Val His Ser Thr 
320 



<210> 25 
<211> 948 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (948) 

<220> 

<223> Description of Artificial Sequence: 

GFP-nucleolus-Caspase 3-annexin II construct 
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<400> 25 

atg get age aaa gga gaa gaa etc tte act gga gtt gte cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

1 5 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gte agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt ace ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 . 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gte act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 24 0 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat gac ttt ttc aag agt gee atg cec gaa ggt tat gta cag gaa 28 6 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 33 6 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gte aag ttt gaa ggt gat ace ctt gtt aat aga ate gag tta aaa ggt 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn TVrg lie Glu Leu Lys Gly 
115 12 0 125 

att gac ttc aag gaa gat ggc aac att ctg gga eac aaa ttg gaa tac 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 / 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat: 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag ace cge cac aac att gaa gat gga age 526 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 'Asp Gly 
180 185 190 

cet gte ctt tta cca gac aac cat tac ctg tec aca caa tct gee ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ece aac gaa aag aga gac cac atg gte ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 720 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 
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gga aga aaa cgt ata cgt act tac etc aag tec tgc agg egg atg aaa 76B 
Gly Arg Lys Arg lie Arg Thr Tyr Leu Lya Ser Cys Arg Arg Met Lys 
245 250 255 

aga agt ggt ttt gag atg tct cga cct att act tec cac ctt act cga 816 
Arg Ser Gly Phe Glu Met Ser Arg Pro He Pro Ser His Leu Thr Arg 
260 265 270 



teg tat gaa aaa gga ata eca gtt gaa aca gac age gaa gag caa get 
Ser Tyr Glu Lye Gly He Pro Val Glu Thr Asp Ser Glu Glu Gin Ala 
275 280 285 



864 



tat agt act atg tct act gtc cac gaa ate ctg tgc aag etc age ttg 912 

Tyr Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu 
290 295 300 

gag ggt gtt cat tct aca ccc eca agt gee gga tec 94 8 

Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 

305 310 315 



<210> 26 
<211> 316 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

GFP-nucleolus-Caspase 3-annexin II construct 

<400> 26 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 
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Gly lie Lys Val Asn Phe 
165 

Val Gin Leu Ala Asp His 
180 

Pro Val Leu Leu' Pro Asp 
195 

Ser Lys Asp Pro Asn Glu 
210 

Val Thr Ala Ala Gly lie 
225 230 

Gly Arg Lys Arg lie Arg 
245 

Arg Ser Gly Phe Glu Met 
260 

Ser Tyr Glu Lys Gly lie 
275 

Tyr Ser Thr Met Ser Thr 
290 

Glu Gly Val His Ser Thr 
305 310 



Lys Thr Arg His Asn 
170 

Tyr Gin Gin Asn Thr 
185 

Asn His Tyr Leu Ser 
200 

Lys Arg Asp His Met 
215 

Thr His Gly Met Asp 
235 

Thr Tyr Leu Lys Ser 
250 

Ser Arg Pro lie Pro 
265 

Pro Val Glu Thr Asp 
280 

Val His Glu lie Leu 
295 

Pro Pro Ser Ala Gly 
315 



lie Glu Asp Gly Ser 
175 

Pro lie Gly Asp Gly 
190 

Thr Gin Ser Ala Leu 
205 

Val Leu Leu Glu Phe 
220 

Glu Leu Tyr Asn Ser 
240 

Cys Arg Arg Met Lys 
255 

Ser His Leu Thr Arg 
270 

Ser Glu Glu Gin Ala 
285 

Cys Lys Leu Ser Leu 
300 

Ser 



<210> 27 
<211> 2088 
<212> DMA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1041) 

<220> 

<223> Description of Artificial Sequence: 
NLS-Fred25-synaptohrevin construct 

<400> 27 

atg aga aga aaa cga caa aag get age aaa gga gaa gaa etc ttc act 4 8 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
1 5 10 15 

gga gtt gtc cca att ctt gtt gaa tta gat ggt gat gtt aac ggc cac 96 
Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 30 

aag ttc tct gtc agt gga gag ggt gaa ggt gat gca aca tac gga aaa 144 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40 45 

Ctt acc ctg aag ttc ate tgc act act ggc aaa ctg cct gtt cca tgg 192 
Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 
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240 



cca aca eta gtc act act ctg tgc tat ggt gtt caa tgc ttt tea aga 
Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 70 75 80 

tac ccg gat cat atg aaa egg cat gac ttt ttc aag agt gcc atg ccc 
Tvr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

gaa ggt tat gta cag gaa agg acc ate ttc ttc aaa gat gac ggc aac 
Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
100 105 110 

tac aag aca cgt get gaa gtc aag ttt gaa ggt gat acc ctt gtt aat 
Tyr bys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

aga ate gag tta aaa ggt att gac ttc aag gaa gat ggc aac att ctg 
Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
130 3.35 140 

gga cac aaa ttg gaa tac aac tat aac tea cac aat gta tac ate atg 
Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
145 150 155 160 

gca gac aaa caa aag aat gga ate aaa gtg aac ttc aag ace cgc cac 
Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys Thr Arg His 
165 170 175 

aac att gaa gat gga age gtt caa eta gca gac cat tat caa caa aat 
Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
180 185 190 

act cca att ggc gat ggc ect gtc ctt tta cca gac aac eat tac ctg 
Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
195 200 205 

tec aca caa tct gcc ctt teg aaa gat ccc aac gaa aag aga gac cac 
Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215 220 

atg gtc ctt ctt gag ttt gta aca get get ggg att aca cat ggc atg 
Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
225 230 235 240 

gat gaa ctg tac aac acc ggt atg tct aca ggt cca act get gcc act 
Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Pro Thr Ala Ala Thr 
245 250 255 

ggc agt aat cga aga ctt cag cag aca caa aat caa gta gat gag gtg 
Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin Val Asp Glu Val 
260 265 270 

gtg gac ata atg ega gtt aac gtg gac aag gtt ctg gaa aga gac cag 
Val Asp He Met Arg Val Asn Val Asp Lys Val Leu Glu Arg Asp Gin 
275 280 285 

aag etc tct gag tta gac gac cgt gca gac gca ctg cag gca ggc get 
Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu Gin Ala Gly Ala 
290 295 300 

tct caa ttt gaa acg age gca gcc aag ttg aag agg aaa tat tgg tgg 



288 



336 



384 



432 



460 



528 



576 



624 



672 



720 



768 



81€ 



864 



912 



960 
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Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg Lys Tyr Trp Trp 
305 310 315 320 

aag aat tgc aag atg tgg gca ate ggg att act gtt ctg gtt ate ttc 1008 
Lys Asn Cys Lys Met Trp Ala lie Gly lie Thr Val Leu Val He Phe 
325 330 335 

ate ate ate ate ate gtg tgg gtt gtc tct tea tgaatgagaa gaaaacgaca 1061 
He He He He He Val Trp Val Val Ser Ser 
340 345 



aaaggctagc 


aaaggagaag 


aactcttcac 


tggagttgtc 


ccaattcttg 


ttgaattaga 


1121 


tggtgatgtt 


aacggccaca 


agttctctgt 


cagtggagag 


ggtgaaggtg 


atgcaacata 


1181 


cggaaaactt 


accctgaagt 


tcatctgcac 


tactggcaaa 


ctgcctgttc 


catggccaac 


1241 


actagtcact 


actctgtgct 


atggtgttca 


atgcttttca 


agatacccgg 


atcatatgaa 


1301 


acggcatgac 


tttttcaaga 


gtgccatgec 


cgaaggttat 


gtacaggaaa 


ggaccatctt 


1361 


cttcaaagat 


gacggcaact 


acaagacacg 


tgctgaagtc 


aagtttgaag 


gtgataccct 


1421 


tgttaataga 


atcgagttaa 


aaggtattga 


cttcaaggaa 


gatggcaaca 


ttctgggaca 


1481 


caaattggaa 


tacaactata 


actcacacaa 


tgtatacatc 


atggcagaca 


aacaaaagaa 


1541 


tggaat caaa 


gtga.acttca 


agacccgcca 


caacattgaa 


gacggaagcg 


uucaaeuagc 




agaccattat 


caacaaaata 


ctccaattgg 


cgatggccct 


gtccttttae 


eagacaacca 


1661 


ttacctgtcc 


acacaatctg 


ccctttcgaa 


agatcccaac 


gaaaagagag 


accacatggt 


1721 


ccttcttgag 


tttgtaacag 


ctgctgggat 


tacacatggc 


atggatgaae 


tgtacaacac 


1781 


cggtatgtct 


ac^ggtccaa 


ctgctgccac 


tggcagtaat 


cgaagactte 


agcagacaca 


1841 


aaatcaagta 


gatgaggtgg 


tggacataat 


gcgagttaac 


gtggacaagg 


ttctggaaag 


1901 


agaccagaag 


ctctctgagt 


tagacgaccg 


tgcagacgca 


ctgeaggcag 


gcgcttctca 


1961 


atttgaaacg 


agcgcagcca 


agttgaagag 


gaaatattgg 


tggaagaatt 


gcaagatgtg 


2021 


ggcaatcggg 


attactgtte 


tggttatctt 


catcatcatc 


atcatcgtgt 


gggttgtctc 


2081 


ttcatga 












2088 



<210> 28 

<211> 347 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
NLS-Fred25-synaptobrevin construct 

<400> 28 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
1 5 10 15 
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Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 30 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40 45 

Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 

Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 70 75 80 

Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 
100 105 110 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 
130 135 140 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met 
145 ISO 155 160 

Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys Thr Arg His 
165 170 175 

Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
180 185 190 

Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
195 200 205 

Ser Thr Gin Ser Ala- Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215 220 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
225 230 235 240 

Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Pro Thr Ala Ala Thr 
245 250 255 

Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin Val Asp Glu Val 
260 265 270 



Val Asp He Met Arg Val Asn Val 
275 280 

Lys Leu Ser Glu Leu Asp Asp Arg 

290 295 

Ser Gin Phe Glu Thr Ser Ala Ala 
305 310 

Lys Asn Cys Lys Met Trp Ala He 
325 

lie He He He He Val Trp Val 



Asp Lys Val Leu Glu Arg Asp Gin 
285 

Ala Asp Ala Leu Gin Ala Gly Ala 
300 

Lys Leu Lys Arg Lys Tyr Trp Trp 
315 320 

Gly He Thr Val Leu Val He Phe 
330 335 

Val Ser Ser 
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340 345 



<210> 25 
<211> 2106 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1050) 

<220> 

<223> Description of Artificial Seq[uence : 
NliS-Fred25-cellTibrevin construct 

<400> 29 

atg aga aga aaa cga caa aag get age aaa gga gaa gaa etc ttc act 4 8 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
1 5 10 15 

gga gtt gtc cca att ctt gtt gaa tta gat ggt gat gtt aac ggc cac 96 
Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 30 

aag ttc tct gtc agt gga gag ggt gaa ggt gat gca aca tac gga aaa 144 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40' 45 

ctt acc ctg aag ttc ate tgc act act ggc aaa ctg cct gtt cca tgg 192 
Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 

cca aca eta gtc act act ctg tgc tat ggt gtt caa tgc ttt tea aga 240 
Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 70 75 80. 

tac ceg gat cat atg aaa egg cat gac ttt ttc aag agt gee atg ccc 2 88 
Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

gaa ggt tat gta cag gaa agg acc ate ttc ttc aaa gat gac ggc aac 336 
Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 
100 105 110 

tac aag aca cgt get gaa gtc aag ttt gaa ggt gat acc ctt gtt aat 3 84 
Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

aga ate gag tta aaa ggt att gac ttc aag gaa gat ggc aac att ctg 432 
Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 
130 135 140 

gga cac aaa ttg gaa tac aac tat aac tea cac aat gta tac ate atg 4 80 
Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met 
145 150 155 160 

gca gac aaa caa aag aat gga ate aaa gtg aac ttc aag ace cgc cac 52 8 
Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys Thr Arg His 
165 170 175 
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aac att gaa gat gga age gtt caa eta gca gac cat tat caa caa aat 
Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
180 165 190 

act cca att ggc gat ggc cct gtc ctt tta cca gac aac cat tac ctg 
Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
195 200 205 

tec aca caa tct gcc ctt teg aaa gat ccc aac gaa aag aga gac cac 
Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215 220 

atg gtc ctt ctt gag ttt gta aca get get ggg att aca cat ggc atg 
Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
225 230 235 240 

gat gaa ctg tac aac acc ggt atg tct aca ggt gtg cct teg ggg tea 
Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Val Pro Ser Gly Ser 
245 250 255 

agt get gcc act ggc agt aat cga aga etc cag cag aca caa aat caa 
Ser Ala Ala Thr Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin 
260 265 270 

gta gat gag gtg gtt gac ate atg aga gtc aat gtg gat aag gtg tta 
Val Asp Glu Val Val Asp He Met Arg Val Asn Val Asp Lys Val Leu 
275 280 285 

gaa aga gac cag aag etc teg gag eta gat gac cgc gca gat gca ctg 
Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu 
290 295 300 

cag gca ggt gcc teg cag ttt gaa aca agt get gee aag ttg aag aga 
Gin Ala Gly Ala Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg 
305 310 315 320 

aag tat tgg tgg aag aac tgc aag atg tgg gcg ata ggg ate agt gtc 
Lvs Tvr Trp Trp Lys Asn Cys Lys Met Trp Ala He Gly He Ser Val 
^ 325 330 335 

ctg gtg ate att gtc ate ate ate ate gtg tgg tgt gtc tct 
Leu Val He He Val He He He He Val Trp Cys Val Ser 
340 345 350 



576 



624 



672 



720 



768 



Bi6 



864 



912 



960 



/1008 



1050 



taaatgagaa 


gaaaacgaca 


aaaggctagc 


aaaggagaag 


aactcttcac 


tggagttgtc 


1110 


ecaattcttg 


ttgaattaga 


tggtgatgtt 


aacggccaca 


agttctctgt 


cagtggagag 


1170 


ggtgaaggtg 


atgeaacata 


cggaaaactt 


accctgaagt 


tcatctgcac 


tactggcaaa 


1230 


ctgcctgttc 


catggccaac 


actagtcact 


actctgtgct 


atggtgttca 


atgcttttca 


1290 


agatacccgg 


atcatatgaa 


acggeatgac 


tttttcaaga 


gtgccatgcc 


cgaaggttat 


1350 


gtacaggaaa ggaecatett 


ettcaaagat 


gacggcaact 


acaagacacg 


tgctgaagte 


1410 


aagtttgaag gtgatacect 


tgttaataga 


atcgagttaa 


aaggtattga 


cttcaaggaa 


1470 


gatggcaaca 


ttctgggaca 


eaaattggaa 


tacaactata 


aetcacacaa 


tgtatacate 


1530 


atggcagaca 


aacaaaagaa 


tggaatcaaa 


gtgaacttca 


agacccgcea 


caacattgaa 


1590 
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gatggaagcg ttcaactagc agaccattat 
gtccttttac cagacaacca ttacctgtcc 
gaaaagagag accacatggt ccttcttgag 
atggatgaac tgtacaacac cggtatgtct 
actggcagta atcgaagact ccagcagaca 
atgagagtca atgtggataa ggtgttagaa 
cgcgcagatg cactgcaggc aggtgcctcg 
agaaagtatt ggtggaagaa ctgcaagatg 
attgtcatca tcatcatcgt gtggtgtgtc 
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caacaaaata ctccaattgg cgatggccct 1650 

acacaatctg ccctttcgaa agatcccaac 1710 

tttgtaacag ctgctgggat tacacatggc 1770 

acaggtgtgc cttcggggtc aagtgctgcc 1830 

caaaatcaag tagatgaggt ggttgacatc 1890 

agagaccaga agctctcgga gctagatgac 1950 

cagtttgaaa caagtgctgc caagttgaag 2010 

tgggcgatag ggatcagtgt cctggtgatc 2 07 0 

tcttaa 2106 



<210> 30 
<211> 350 
<2X2> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; 
NLiS-Fred25-cellubrevin construct 

<400> 30 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
1 5 10 15 

Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 30 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40 45 

Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 

Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 70 75 BO 

Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
100 105 110 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
130 135 140 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
145 150 155 160 

Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys Thr Arg His 
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165 170 175 

Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyx Gin Gin Asn 
180 185 190 

Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr I*eu 
195 200 205 

Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215 220 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
225 230 235 240 

Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Val Pro Ser Gly Ser 
245 250 255 

Ser Ala Ala Thr Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin 
260 265 270 

Val Asp Glu Val Val Asp He Met Arg Val Asn Val Asp Lys Val Leu 
275 280 285 

Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu 
290 295 300 

Gin Ala Gly Ala Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg 
305 310 315 - 320 

Lys Tyr Trp Trp Lys Asn Cys Lys Met Trp Ala He Gly He Ser Val 
325 330 335 

Leu Val He He Val He He He He Val Trp Cys Val Ser 
340 345 350 



<210> 31 
<211> 3171 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (3168) 

<220> 

<223> Description of Artificial Sequence: 
NLS-EYFP-MAPKDM-EBFP construct 

<:400> 31 

atg agg ccc aga aga aag gtg age aag ggc gag gag ctg ttc acc ggg 4 8 

Met Arg Pro Arg Arg Lys Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
1.5 10 15 

gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac ggc cac aag 96 
Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
20 25 30 

ttc age gtg tec ggc gag ggc gag ggc gat gcc acc tac ggc aag ctg 144 
Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
35 40 45 
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acc 
Thr 


ctg 
Leu 
50 


aag 
Lys 


ttc 
Phe 


ate 
He 


tgc 
Cys 


acc 
Thr 
55 


acc 
Thr 


ggc 

Gly 


aag 
Lys 


ctg 
Leu 


ccc 
Pro 
60 


gtg 
Val 


ccc 
Pro 


tgg 
Trp 


ccc 
Pro 


192 


acc 
Thr 
€5 


etc 
Leu 


gtg 
Val 


acc 
Thr 


acc 
Thr 


ttc 
Phe 
70 


ggc 

Gly 


tac 
Tyr 


ggc 

Gly 


ctg 
Leu 


cag 
Gin 
75 


tgc 
Cys 


ttc 
Phe 


gee 
Ala 


cgc tac 
Arg Tyr 
80 


240 


ccc 
Pro 


gac 
Asp 


cac 
His 


atg 
Met 


aag 
Lys 
85 


cag 
Gin 


cac 
His 


gac 
Asp 


ttc 
Phe 


ttc 
Phe 
90 


aag 
Lys 


tec 
Ser 


gee 
Ala 


atg 
Met 


ccc 
Pro 
95 


gaa 
Glu 


288 


ggc 
Gly 


tac 
Tyr 


gtc 
Val 


cag 
Gin 
100 


gag 
Glu 


cgc 
Arg 


acc 
Thr 


ate 
He 


ttc 
Phe 
105 


ttc 
Phe 


aag 
Lys 


gac 
Asp 


gac 
Asp 


ggc 
Gly 
110 


aac 
Asn 


tac 
Tyr 


336 


aag 
Lye 


ace 
Thr 


cgc 
Arg 
115 


gee 
Ala 


gag 
Glu 


gtg 
Val 


aag 
Lys 


ttc 
Phe 
120 


gag 
Glu 


ggc gac 
Gly Asp 


acc 
Thr 


ctg 
Leu 
125 


gtg 
Val 


aac 
Asn 


cgc 
Arg 


3 84 


ate 
He 


gag 
Glu 
130 


ctg 
Leu 


aag 
Lys 


ggc 
Gly 


ate 
He 


gac 
Asp 
135 


ttc 
Phe 


aag 
Lys 


gag 
Glu 


gac 
Asp 


ggc 
Gly 
140 


aac 
Asn 


ate 
He 


ctg ggg 
Leu Gly 


432 


cac 
His 
145 


aag 
Lys 


ctg 
Leu 


gag 
Glu 


tac 
Tyr 


aac 
Asn 
150 


tac 
Tyr 


aac 
Asn 


age 
Ser 


cac 
His 


aac 
Asn 
155 


gtc 
Val 


tat 
Tyr_ 


ate 
He 


atg 
Met 


gee 
Ala 
160 


480 


gac 
Asp 


aag 
Lys 


cag 
Gin 


aag 
Lys 


aac 
Asn 
165 


ggc 
Gly 


ate 
He 


aag 
Lys 


gtg 
Val 


aac 
Asn 
170 


ttc 
Phe 


aag 
Lys 


ate 
He 


cgc 
Arg 


cac 
His 
175 


aac 
Asn 


528 


ate 
He 


gag 
Glu 


gac 
Asp 


ggc 
Gly 
IBO 


age 
Ser 


gtg 
Val 


cag 
Gin 


etc 
Leu 


gee 
Ala 
185 


gac 
Asp 


cac 
His 


tac 
Tyr 


cag 
Gin 


cag 
Gin 
190 


aac 
Asn 


acc 
Thr 


576 


ccc 
Pro 


ate 
He 


ggc 
Gly 
195 


gac 
Asp 


ggc 
Gly 


ccc 
Pro 


gtg 
val 


ctg 
Leu 
200 


ctg 
Leu 


ccc 
Pro 


gac 

Asp 


aac 
Asn 


cac 
His 
205 


tac 
Tyr 


ctg 
Leu 


age 
Ser 


624 


tac 
Tyr 


cag 
Gin 
210 


tec 
Ser 


gee 
Ala 


ctg 
Leu 


age 
Ser 


aaa 
Lys 
215 


gac 
Asp 


ccc 
Pro 


aac 
Asn 


gag 

Glu 


aag 
Lys 
220 


cgc 
Arg 


gat 
Asp 


cac 
His 


atg 
Met 


672 


gtc 
Val 
225 


ctg 
Leu 


ctg 
Leu 


gag 
Glu 


ttc 
Phe 


gtg 
Val 
230 


ace 
Thr 


gee 
Ala 


gee 
Ala 


ggg 

Gly 


ate 
He 
235 


act 
Thr 


etc 
Leu 


ggc 

Gly 


atg 
Met 


gac 
Asp 
240 


720 


gag 
Glu 


ctg 
Leu 


tac 
Tyr 


aag 
Lys 


aag 
Lys 
245 


gga 
Gly 


gac 
Asp 


gaa 
Glu 


gtg 
Val 


gac 
Asp 
250 


gga gee 
Gly Ala 


gac 
Asp 


etc 
Leu 


agt 
Ser 
255 


ctt 
Leu 


768 


gtg 
Val 


gat 

Asp 


gcg 
Ala 


ttg 
Leu 
260 


aca 
Thr 


gaa 
Glu 


cea 
Pro 


cct 
Pro 


cea 
Pro 
265 


gaa 
Glu 


att 
He 


gag 

Glu 


gga 
Gly 


gaa 
Glu 
270 


ata 
He 


aag 
Lys 


816 


cga 
Arg 


gac 
Asp 


ttc 
Phe 
275 


atg 
Met 


get 
Ala 


gcg 
Ala 


ctg 
Leu 


gag 
Glu 
280 


gea 
Ala 


gag 
Glu 


ccc 
Pro 


tat 
Tyr 


gat 
Asp 
285 


gac 
Asp 


ate 
He 


gtg 
Val 


8 64 
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gga gaa act gtg gag aaa act gag ttt att cct etc ctg gat ggt gat 912 
Gly Glu Thr Val Glu Lys Thr Glu Phe lie Pro Leu Leu Asp Gly Asp 
290 295 300 



gag aaa acc ggg aac tea gag tec aaa aag aaa ccc tgc tta gac act 

Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr 
305 310 315 320 

age cag gtt gaa ggt ate cea tct tct aaa cca aca etc eta gcc aat 

Ser Gin Val Glu Gly lie Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn 

325 330 335 

ggt gat cat gga atg gag ggg aat aac act gea ggg tct eca act gac 

Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp 

340 345 350 



cca aac tct cca tgt tea gag tec tgt gtc tec cca gag gtt act ata 
Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr He 
450 455 460 

gaa acc eta cag cca gea aca gag etc tec aag gea gca gaa gtg gaa 
Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu 
465 470 475 480 



cca gac aca gag gca gea ctg get aaa gac ata gaa gag ate acc aag 

Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp He Glu Glu He Thr Lys 

515 520 . 525 

cca gat gtg ata ttg gca aat gtc acg cag eca tct act gaa teg gat 



960 



1008 



1056 



ttc ctt gaa gag aga gtg gac tat ccg gat tat cag age age cag aac 1104 
Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn 
355 360 365 

tgg cca gaa gat gca age ttt tgt ttc cag cct cag caa gtg tta gat li52 
Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp 
370 375 380 

act gac cag get gag cec ttt aac gag eac cgt gat gat ggt ttg gca 1200 
Thr Asp Gin Ala Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala 
385 390 395 400 

gat ctg etc ttt gtc tec agt gga ccc acg aac get tct gca ttt aca 124 8 
Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr 
405 410 415 

gag cga gac aat cct tea gaa gac agt tac ggt atg ctt ccc tgt gac 1296 
Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp 
420 425 430 

tea ttt get tec acg get gtt gta tct cag gag tgg tct gtg gga gcc 1344 
Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala 
435 440 445 



1392 



1440 



tea gtg aaa gag cag ctg cca get aaa gca ttg gaa acg atg gca gag 148 8 
Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu 
485 490 495 

cag acc act gat gtg gtg eac tct eca tec aca gac aca aca cea ggc 1536 
Gin Thr Thr Asp Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly 
500 505 510 



1584 



1632 
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Pro Asp Val lie Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp 
530 535 540 



610 



cca gcc aag ggc atg gtt tea etc tea gaa ata gaa gag get ctg gca 
Pro Ala Lys Gly Met Val Ser Leu Ser Glu lie Glu Glu Ala Leu Ala 

630 635 640 



625 



1680 



atg ttc ctg gcc cag gac atg gaa eta etc aca gga aca gag gca gcc 
Met Phe Leu Ala Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala 
545 S50 555 560 

cac get aae aat ate ata ttg cct aca gaa cea gac gaa tet tea ace 
His Ala Aan Asn lie lie Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr 
565 570 575 

aag gat gta gca cea cct atg gaa gaa gaa att gtc cca ggc aat gat 
Lys Asp Val Ala Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp 
580 585 590 

acg aca tee ecc aaa gaa aca gag aca aca ctt cca ata aaa atg gac 
Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp 
595 600 605 

ttg gca cca cct gag gat gtg tta ctt ace aaa gaa aca gaa eta gcc 1872 
Leu Ala Pro Pro Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala 

615 620 



1728 



1776 



1824 



1920 



aag aat gat gtt cgc tet gca gaa ata cct gtg get cag gag aca gtg 1968 
Lys Asn Asp Val Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val 
645 650 655 



20i6 



2064 



gtc tea gaa aca gag gtg gtc ctg gca aca gaa gtg gta ctg ecc tea 
Val Ser Glu Thr Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser 
660 665 670 

gat ecc ata aca aca ttg aca aag gat gtg aca etc ecc tta^ gaa gca 
Asp Pro He Thr Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala 
675 680 685 

gag aga ccg ttg gtg acg gac atg act cca tet ctg gaa aca gaa atg 2112 
Glu Arg Pro Leu Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met 
690 695 700 

ace eta ggc aaa gag aca get cca ecc aca gaa aca aat ttg ggc atg 2160 
Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met 
705 710 715 720 

gee aaa gac atg tet cea etc cca gaa tea gaa gtg act ctg ggc aag 2208 
Ala Lys Asp Met Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys 
725 730 735 

gac gtg gtt ata ctt cca gaa aca aag gtg get gag ttt aae aat gtg 2256 
Asp Val Val He Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val 
740 745 750 

act cea ctt tea gaa gaa gag gta ace tea gtc aag gac atg tet ccg 2304 
Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro 
755 760 765 

tet gca gaa aca gag get ecc ctg get aag aat get gat ctg cac tea 2352 
Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser 
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770 775 780 

gga aca gag ctg att gtg gac aac age atg get cca gcc tec gat ctt 2400 
Gly Thr Glu Leu lie Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu 
785 790 795 800 

gca ctg ccc ttg gaa aca aaa gta gca aca gtt cca att aaa gac aaa 2448 
Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val Pro He Lys Asp Lys 
805 810 815 

gga atg gtg age aag ggc gag gag ctg ttc ace ggg gtg gtg ccc ate 2496 
Gly Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
820 825 830 

ctg gtc gag ctg gac ggc gac gta aac gge cac aag ttc age gtg tec 2544 
Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 
835 840 845 

ggc gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc 2592 
Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
850 855 860 

ate tgc acc ace ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc 264 0 
He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 
865 870 875 880 

acc' ctg acc cac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg 2688 
Thr Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
685 890 895 

aag cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag 2736 
Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
900 905 910 

gag cgc acc ate ttc ttc aag gac gac ggc aac tac aag ace cgc gcc 2784 
Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
915 920 925 ^ 

gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag 2 832 
Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys 
930 935 940 

ggc ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag 2 880 
Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu 
945 950 955 960 

tac aac ttc aac age cac aac gtc tat ate atg gcc gac aag cag aag 2928 
Tyr Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 
965 970 975 

aac ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc 2 976 
Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 
980 985 990 

age gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac 3 024 
Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 
995 1000 1005 

ggc ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc 3 072 
Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
1010 1015 1020 
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ctg age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag 3120 

Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 
1025 1030 1035 1040 

ttc gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag 316B 

Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1045 1050 1055 

tag 3171 



<210> 32 
<211> 1056 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
NLS-EYFP-MAPKDM-EBFP construct 

<400> 32 

Met Arg Pro Arg Arg Lys Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
1 5 10 15 

Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
20 25 30 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
35 40 45 

Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
50 55 60 

Thr Leu Val Thr Thr Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr 
65 70 75 80 



Pro Asp His Met Lys Gin His Asp 
85 

Gly Tyr Val Gin Glu Arg Thr lie 
100 

Lys Thr Arg Ala Glu Val Lys Phe 
115 120 

lie Glu Leu Lys Gly lie Asp Phe 
130 135 



Phe Phe Lys Ser Ala Met Pro Glu 
90 95 

Phe Phe Lys Asp Asp Gly Asn Tyr 
105 110 

Glu Gly Asp Thr Leu Val Asn Arg 
125 

Lys Glu Asp Gly Asn He Leu Gly 
140 

Met Ala 
160 



His Lys Leu Glu Tyr Asn Tyr 
145 150 

Asp Lys Gin Lys Asn Gly He 
165 

He Glu Asp Gly Ser Val Gin 
180 

Pro He Gly Asp Gly Pro Val 
195 



Asn Ser His Asn Val Tyr He 
155 

Lys Val Asn Phe Lys He Arg 
170 

Leu Ala Asp His Tyr Gin Gin 
185 190 

Leu Leu Pro Asp Asn His Tyr 
200 205 



His Asn 
175 

Asn Thr 
Leu Ser 
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Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
210 215 220 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
225 230 235 ^ 240 

Glu Leu Tyr Lys Lys Gly Asp Glu Val Asp Gly Ala Asp Leu Ser Leu 
245 250 255 

Val Asp Ala Leu Thr Glu Pro Pro Pro Glu He Glu Gly Glu He Lys 
260 265 270 

Arg Asp Phe Met Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp lie Val 
275 280 285 

Gly Glu Thr Val Glu Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp 
290 295 300 

Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr 
305 310 315 320 

Ser Gin Val Glu Gly- He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn 
325 . 330 335 

Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp 
340 345 350 

Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn 
355 360 - 365 

Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp 
370 375 380 

Thr Asp Gin Ala Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala 
385 390 395 40O 

Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr 
405 410 415 

Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp 
420 425 430 

Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala 
435 440 445 

Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr He 
450 455 460 

Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu 
465 470 475 480 

Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu 
485 490 495 

Gin Thr Thr Asp Val Val His Ser Pro Ser Thr Asp Thr , Thr Pro Gly 
500 505 510 

Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp He Glu Glu He Thr Lys 
515 520 525 

Pro Asp Val He Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp 
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530 535 540 

Met Phe Leu Ala Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala 
545 550 555 560 

His Ala Asn Asn lie lie Leu Pro Thr Glu Pro Asp Glu Ser Ser Tiir 
565 570 575" 

Lys Asp Val Ala Pro Pro Met Glu Glu Glu lie Val Pro Gly Asn Asp 
580 585 590 

Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr Leu Pro lie Lys Met Asp 
595 600 605 

Leu Ala Pro Pro Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala 
610 615 620 

Pro Ala Lys Gly Met Val Ser Leu Ser Glu lie Glu Glu Ala Leu Ala 
625 630 635 640 

Lys Asn Asp Val Arg Ser Ala Glu lie Pro Val Ala Gin Glu Thr Val 
645 650 655 

Val Ser Glu Thr Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser 
660 665 670 

Asp Pro He Thr Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala 
675 680 685 

Glu Arg Pro Leu Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met 
690 695 700 

Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met 
705 710 715 720 

Ala Lys Asp Met Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys 
^ 725 730 735 

Asp Val Val He Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val 
740 745 750 

Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro 
755 760 765 

Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser 
770 775 780 

Gly Thr Glu Leu He Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu 
785 790 795 BOO 

Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val Pro He Lys Asp Lys 
805 810 815 

Gly Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
820 825 830 

Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 
835 840 845 

Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
850 855 860 
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lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 
865 870 875 8B0 

Thr Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
885 890 895 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
900 905 910 

Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
915 920 925 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys 
930 935 940 

Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu 
945 950 955 960 

Tyr Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 
965 970 975 

Asn Gly He Lys Val Asn Ehe Lys He Arg His Asn He Glu Asp Gly 
980 985 990 

Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 
995 1000 1005 

Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
1010 1015 1020 

Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 
1025 1030 1035 1040 

Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
/ 1045 1050 1055 



<210> 33 

<211> 1623 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1623) 

<220> 

<223> Description of Artificial Sequence: 

yFP-NIiS-CP3 -multiple DEVD-CFP-Annexin II construct 

<400> 33 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag 99^: gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
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Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys hen Thr Xieu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ttc ggc tac ggc ctg cag tgc ttc gee cgc tac ccc gac cac atg aag 24 0 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 4 80 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag^ etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro'^Ile Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga aga agg aaa cga caa aag cga teg gca ggt gac gaa gtt gat gca 7 68 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 
245 250 255 

ggt gac gaa gtt gat gca ggt gac gaa gtt gat gca ggt gac gaa gtt 816 
Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

gac gca ggt agt act atg gtg age aag ggc gag gag ctg ttc acc ggg 864 
Asp Ala Gly Ser Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
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275 280 285 

gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac ggc cac aag 912 

Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 

290 295 300 

ttc age gtg tec ggc gag ggc gag ggc gat gcc acc tac ggc aag ctg 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 

305 310 315 320 

acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc 

Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
325 330 335 



ccc gac cac atg aag cag cac gac ttc ttc aag tec gcc atg ccc gaa 
Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
355 360 365 



aag acc cgc gcc gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc 
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
385 390 395 400 



435 440 445 



450 455 460 

ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac tac ctg age 

Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 

465 470 475 4B0 

acc cag tec gcc ctg age aaa gac ccc aac gag aag cgc gat cac atg 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 

485 490 495 



960 



1006 



acc etc gtg acc acc ctg acc tgg ggc gtg cag tgc ttc age cgc tac 1056 
Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr 
340 345 350 



1104 



ggc tac gtc cag gag cgc acc ate ttc ttc aag gac gac ggc aac tac 1152 
Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr 
370 375 380 



1200 



ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac ate ctg ggg 1248 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 
405 410 415 

cac aag ctg gag tac aac tac ate age cac aac gtc tat ate acc gcc 

His Lys Leu Glu/Tyr Asn Tyr He Ser His Asn Val Tyr He Thr Ala 

420 425 / 430 



1296 



gac aag cag aag aac ggc ate aag gcc aac ttc aag ate cgc cac aac 1344 
Asp Lys Gin Lys Asn Gly He Lys Ala Asn Phe Lys He Arg His Asn 



ate gag gac ggc age gtg cag etc gcc gac cac tac cag cag aac acc 1392 
He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 



1440 



1488 



gtc ctg ctg gag ttc gtg acc gcc gcc ggg ate act etc ggc atg gac 1536 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
500 505 510 

gag ctg tac aag atg tet act gtc cac gaa ate ctg tgc aag etc age 1584 

Glu Leu Tyr Lys Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser 
515 " 520 525 
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ttg gag ggt gtt cat tct aca ccc 
Leu Glu Gly Val His Ser Thr Pro 
530 535 



cca agt gcc gga tec 
Pro Ser Ala Gly Ser 
540 



<210> 34 
<211> 541 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

yFP-NIiS-CP3 -multiple DEVD-CFP-Annexin II construct 

<400> 34 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe PJae Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 . 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leii Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 
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Gly Arg Arg Lys Arg 
245 

Gly Asp Glu Val Asp 
260 

Asp Ala Gly Ser Thr 
275 

Val Val Pro lie JJeu 
290 

Phe Ser Val Ser Gly 
305 

Thr Leu Lys Phe lie 
325 

Thr Leu Val Thr Thr 
340 

Pro Asp His Met Lys 
355 

Gly Tyr Val Gin Glu 
370 

Lys Thr Arg Ala Glu 
3B5 

He Glu Leu Lys Gly 
405 

His Lys Leu Glu Tyr 
420 

Asp Lys Gin Lys Asn 
435 

He Glu Asp Gly Ser 
450 

Pro He Gly Asp Gly 
465 

Thr Gin Ser Ala Leu 
465 

Val Leu Leu Glu Phe 
500 

Glu Leu Tyr Lys Met 
515 

Leu Glu Gly Val His 
530 



Gin Lys Arg Ser Ala Gly 
250 

Ala Gly Asp Glu Val Asp 
265 

Met Val Ser Lys Gly Glu 
280 

Val Glu Leu Asp Gly Asp 
295 

Glu Gly Glu Gly Asp Ala 
310 315 

Cys Thr Thr Gly Lys Leu 
330 

Leu Thr Trp Gly Val Gin 
345 

Gin His Asp Phe Phe Lys 
360 

Arg Thr He Phe Phe Lys 
375 

Val Lys Phe Glu Gly Asp 
390 - 395 

He Asp Phe Lys Glu Asp 
410 

Asn Tyr He Ser His Asn 
425 

Gly He Lys Ala Asn Phe 
440 

Val Gin Leu Ala Asp His 
455 

Pro Val Leu Leu Pro Asp 
470 475 

Ser Lys Asp Pro Asn Glu 
490 

Val Thr Ala Ala Gly He 
505 

Ser Thr Val His Glu He 
520 

Ser Thr Pro Pro Ser Ala 
535 



Asp Glu Val Asp Ala 
255 

Ala Gly Asp Glu Val 
270 

Glu Leu Phe Thr Gly 
285 

Val Asn Gly His Lys 
300 

Thr Tyr Gly Lys Leu 
320 

Pro Val Pro Trp Pro 
335 

Cys Phe Ser Arg Tyr- 
350 

Ser Ala Met Pro Glu 
365 

Asp Asp Gly Asn Tyr 
380 

Thr Leu Val Asn Arg 
400 

Gly Asn He Leu Gly 
415 

Val Tyr He Thr Ala 
430 

Lys He Arg His Asn 
445 

Tyr Gin Gin Asn Thr 
460 

Asn His Tyr Leu Ser 
480 

Lys Arg Asp His Met 
495 

Thr Leu Gly Met Asp 
510 

Leu Cys Lys Leu Ser 
525 

Gly Ser 
540 



<210> 35 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FIiAG epitope 
<400> 35 

gactacaaag acgacgacga caaa 



<210> 36 
<211> 8 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: FLAG epitope 
<400> 36 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HA epitope 
<400> 37 

tacccatacg acgtaccaga ctacgca 



<210> 38 ^ 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HA epitope 
<400> 38 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
15 



<210> 39 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: KT3 epitope 
<400> 39 

ccaccagaac cagaaaca 18 



<210> 40 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: KT3 epitope 
<400> 40 

Pro Pro Glu Pro Glu Thr 
1 5 



<210> 41 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Myc epitope 
<400> 41 

gcagaagaac aaaaattaat aagcgaagaa gactta 36 

<210> 42 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Myc epitope 



<400> 42 

"Ala~Glu~Glu Gin" Lys Leu lie Sef Glu Glu Asp Leu 
15 10 



<:210> 43 
<211> 717 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 
<222> (1) . . (717) 

<220> 

<223> Description of Artificial Sequence: EYFP 
<400> 43 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



48 



gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



144 
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tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ttc ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc gac cac atg aag 24 0 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 . 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtb cag gag 28 8 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
B5 90 95 

cgc acc ate ttc ttc aag gac gac ggc aae tac aag acc cgc gcc gag 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 4 80 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 52 8 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

<210> 44 
<211> 239 
<212> PRT 

<213> Artificial .Sequence 
<220> 

<223> Description of Artificial Sequence: EYFP 
<400> 44 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 45 
<211> 717 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<:222> (1).,{717) 

<220> 

<223> Description of Artificial Sequence: EGFP 
<400> 45 

atg gtg age aag ggc gag gag ctg ttc ace ggg gtg gtg ccc ate ctg 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 



192 



240 



288 



cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag^gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

-iin 1*^*^ 140 



480 



130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 
165 179 175- 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
ISO 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gee ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 46 . 

<211> 239 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: EGPP 
<400> 46 . 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly Hie Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Glri Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu' Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 ' ^ 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asi; Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr AJa Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<2iO> 47 

<211> 717 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (717) 

<220> 

<223> Description of Artificial Sequence: EBFP 
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<400> 47 

atg gtg age aag ggc gag gag. ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 



gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



ctg acc cac ggc gtg cag tgc ttc age cge tac ccc gac cac atg aag 
Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

70 75 80 



65 



.cge acc ate ttc ttc aag gac gac ggc aac tac aag acc cge gee gag 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu. 
100 105 110 



145 



150 155 160 



gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



96 



gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 



240 



cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



336 



gtg aag ttc gag ggc gac acc ctg gtg aac cge ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac ttc aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 
Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 



480 



ggc ate aag gtg aac ttc aag ate cge cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cge gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



717 



78 



wo 00/50872 PCT/USOO/04794 



<210> 48 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: EBFP 
<400> 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 IS 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 1S5 160 

Glv He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 4la Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 . 235 



<2I0> 49 
<211> 717 
<:212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (1) . . (717) 

<220> 

<223> Description of Artificial Sequence: ECFP 
<400> 49 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee ace tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg cce gtg ccc tgg ccc acc etc gtg acc aec 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tgg ggc gtg cag tgc tte age cgc tac ccc gac cae atg aag 240 
Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tee gee atg ecc gaa ggc tac gtc eag gag 2 88 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag ace ege gee gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu^ 
100 105 110 

gtg aag ttc gag ggc gac ace ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac tte aag gag gac ggc aac ate ctg ggg cae aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac ate age cac aac gtc tat ate acc gee gac aag cag aag aac 4 80 
Asn Tyr He Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gee aac ttc aag ate cgc cac aac ate gag gac ggc age 52 8 
Gly He Lys Ala TVsn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age aec cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 
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age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 50 
<211>.239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ECFP 
<400> 50 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 ' 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 BO 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly A^n Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu . Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly. Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr He Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Ala Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
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225 230 235 



<210> 51 
<211> 720 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (717) 

<220> 

<223> Description of Artificial Secpience: Fred25 
<400> 51 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 48 
Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

gtt gaa tta gat ggt gat gtt aac g'gc cac aag ttc tct gtc agt gga 9& 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc act act ggc aaa ctg ect gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ecg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat gae ttt ttc aag agt gcc atg ccc gaa ggt tat gta cag gaa 288 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gae ggc aac tac aag aca cgt get gaa 33 6 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

att gae ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gae aaa caa aag aat 4 80 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala TVsp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac "ttc aag acc cgc cac aac att gaa gat gga age 528 
Gly lie Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gae cat tat caa caa aat act cca att ggc gat ggc 576 
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Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tot gcc ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
ser Lys Asp Pro Asn Glu Lye Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tag 72 0 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
225 230 235 



<210> 52 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Freci25 

<400> 52 , . ^ ^1 X 

Met Ala ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 - 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arq His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn 
225 230 235 



<210> 53 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-1 , 4 , 5 
substrate recognition sequence 

<400> 53 
tgggaacatg acaa 



<210> 54 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-1 , 4, 5 
substrate recognition sequence 

<400> 54 
T2TP Glu His Asp 
1 



<210> 55 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-1 
substrate recognition sequence 

<4O0> 55 
tggtttaaag ac 



<210> 56 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-1 
substrate recognition sequence 

<400> 56 

Trp Phe Lys Asp 
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1 



<210> 57 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of J\rtificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 57 

gacgaacacg ac 12 



<210> 58 
<211> 4 
<212> PRT 

<;213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 58 
Asp Glu His Asp 
1 



<210> 59 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 59 

gacgaagttg ac 12 



<210> 60 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 60 
Asp Glu Val Asp 
1 



<210> 61 

<211> 12 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 

<400> 61 
atagaaacag ac 



<210> 62 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 

<400> 62 
lie Glu Thr Asp 
1 



<210> 63 
<211> 12 
<212> DNA 

<213> Artificial Sec[uence 
<220> 

<223> Description of Artificial Sequence: proCaspase-4 , 5 
substrate recognition sequence 

<400> 63 

tgggtaagag ac 12 



<210> 64 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-4 , 5 
substrate recognition sequence 

<400> 64 
Trp Val Arg Asp 
1 



<210> 65 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<:400> 65 

gtagaaatag ac 12 
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<210> 66 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 66 
Val Glu lie Asp 
1 



<210> 67 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-e 
substrate recognition sequence 

<400> 67 
gtagaacacg ac 



<210> 68 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition seqiience 

<400> 68 
Val Glu His Asp 
1 



<210> 69 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-6 
substrate recognition sequence 

<400> 69 

acagaagtag ac ^2 



<210> 70 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: proCaspase-e 
substrate recognition sequence 

<400> 70 
Thr Glu Val Asp 
1 



<210> 71 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; proCaspase-? 
substrate recognition sequence 

<400> 71 
atacaagcag ac 



<210> 72 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase- 7 
substrate recognition sequence 

<400> 72 
lie Gin Ala Asp 
1 



<210> 73 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
substrate recognition sequence 

<400> 73 
gtagaaacag ac 



<210> 74 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-B 
substrate recognition sequence 

<400> 74 
Val Glu Thr Asp 
1 
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<210> 75 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Seqfuence : proCaspase-8 
substrate recognition sequence 

<400> 75 
ttagaaacag ac 



<210> 76 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-8 
substrate recognition sequence 

<400> 76 
Leu Glu Thr Asp 
1 



<210> 77 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CaspaBe-9 
substrate recognition sequence 

<400> 77 
ttagaacacg ac 



<210> 78 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-9 
substrate recognition sequence 

<400> 78 
Leu Glu His Asp 
1 



<210> 79 

<211> 12 

<212> DNA 

<213> Artificial 



Sequence 
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<220> ^ 
<223> Description of Artificial Sequence: proCaspai 
siibstrate recognition sequence 

<400> 79 
ttagaacacg ac 



<210> BO 

<211> 4 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : proCaspai 
substrate recognition sequence 



<400> 80 
Leu Glu His Asp 
1 



<210> 81 
<211> 12 
<212> DNA 

c213> Artificial Sequence 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 81 3^2 
agccaaaatt ac 



<210> 82 
<211> 4 
<212> PRT 

<;213> Artificial Sequence 

<223> Description of Artificial Sequence: HIV prote 
substrate recognition sequence 



<400> 82 
Ser Gin Asn Tyr 
1 



<210> 83 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 83 
ccaatagtac aa 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 57 

gacgaacacg ac 12 



c210> 56 
<211> 4 
<212> PRT 

<213> /artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 58 

Asp Glu His Asp 

1 



<210> 59 

<211> 12 - ^ 

<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 59 

gacgaagttg ac 12 



<210> 60 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
sLibstrate recognition sequence 

<400> 60 
Asp Glu Val Asp 
1 



<210> 61 

<211> 12 

<212> DNA 

<213> Artificial 

<220> 



Sequence 
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<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 

<400> 61 
atagaaacag ac 



<210> 62 

<211> 4 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-S 
substrate recognition sequence 

<400> 62 
lie Glu Thr Asp 
1 



<210> 63 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-4 , 5 
substrate recognition sequence 

<400> 63 
tgggtaagag ac 



<:210> 64 / 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-4 , 5 
substrate recognition sequence 

<400> 64 
Trp Val Arg Asp 
1 



<210> 65 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
siobstrate recognition sequence 
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<400> 65 

gtagaaatag ac 12 

<210> 66 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 66 
Val Glu lie Asp 
1 



<210> 67 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 67 

gtagaacacg ac 12 



<210> 68 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 



<400> 68 
Val Glu His Asp 
1 



<210> 69 

<211> 12 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: proCaBpase-6 
substrate recognition sequence 

<400> 69 

acagaagtag ac 12 
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<210> 70 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-6 
substrate recognition sequence 

<400> 70 
Thr Glu Val Asp 
1 



<210> 71 
<211> 12 
<212> DWA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaBpase-7 
substrate recognition sequence 

<400> 71 

atacaagcag ac _ 12 



<210> 72 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 

<400> 72 
lie Gin Ala Asp 
1 



<210> 73 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-B 
substrate recognition sequence 

<400> 73 

gtagaaacag ac 12 



<210> 74 
<211> 4 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
substrate recognition sequence 

<400> 74 
Val Glu Thr Asp 
1 



<210> 75 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-8 
substrate recognition sequence 

<400> 75 
ttagaaacag ac 



<2i0> 76 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-8 
substrate recognition sequence 

<400> 76 
Leu Glu Thr Asp 
1 



<210> 77 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-9 
substrate recognition sequence 

<400> 77 
ttagaacacg ac 



<210> 76 

<211> 4 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Caspase-9 
substrate recognition sequence 

<400> 78 
Leu Glu His Asp 
1 



<210> 79 

<211> 12 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: proCaspase-9 
substrate recognition sequence 

<400> 79 
ttagaacacg ac 



<210> 80 
<211> 4 
<212> PRT 

<213> Artificial Sec[uence 
<220> 

<223> Description of Artificial Sequence: proCaspase-9 
,substrate recognition sequence 

<:400> 80 

Leu Glu His Asp 



<210> 61 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 81 
agccaaaatt ac 



<210> 82 
<2X1> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HXV protease 
substrate recognition sequence 
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<400> 82 
Ser Gin Asn Tyr 
1 



<210> 83 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 83 
ccaatagtac aa 



<210> 84 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 84 
Pro lie Val Gin 
1 



<210> 85 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 85 
atgtttggag ga 



<210> 86 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 86 

Met Phe Gly Gly 
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1 



<210> 87 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 87 

gcaaaaaaaa ga 12 



<210> 88 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 88 
Ala Lys Lys Arg 
1 



<210> 89 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretasc 
substrate recognition secjuence 

<400> 89 
gtgaaaatg 



<210> 90 
<211> 3 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sec[uence: b-Secretase 
substrate recognition sequence 

<400> 90 
Val Lys Met 
1 



98 



WO00/S0872 



PCT/USOO/04794 



<210> 91 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 91 

gacgcagaat tc 12 



<210> 92 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 92 
Asp Ala Glu Phe 
1 



<210> 93 
c211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cathepsin D 
^substrate recognition sequence 

<400> 93 

aaaccagcat tattc 15 



<210> 94 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cathepsin D 
substrate recognition sequence 

<400> 94 

Lys Pro Ala Leu Phe 
1 5 



<210> 95 
<211> 9 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secfuence: Cathepsin D 
substrate recognition sequence 

<400> 95 

ttcagatta 9 



<210> 96 
<211> 3 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Cathepsin D 
substrate recognition sequence 

<400> 96 
Phe Arg Leu 
1 



<210> 97 
<211> 15 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial 
Metalloprotease substrate 

<400> 97/ 
ggaccattag gacca 



Sequence : Matrix 
recognition sequence 

/ .15 



<210> 98 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Matrix 

Metalloprotease substrate recognition sequence 

<400> 98 

Gly Pro Leu Gly Pro 
1 5 



<210> 99 

<211> 12 

<212> DNA 

<213> Artificial 

<220> 



Sequence 



100 
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<223> Description of Artificial Sequence: Granzyme B 
substrate recognition sequence 

<400> 99 
atagaaccag ac 



<210> 100 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Granzyme B 
substrate recognition sequence 

<400> 100 
lie Glu Pro Asp 
1 



<210> 101 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 

<400> 101 

atgcccaaga agaagccgac gcccatccag ctgaac 



<210> 102 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence: Anthrax 
protease substrate recognition sequence 

<400> 102 

Met Pro Lys Lys Lys Pro Thr Pro lie Gin Leu Asn 
1 5 10 



<210> 103 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 



101 



BNSDOCID: «:WO 0O50a72A2j_> 



wo 00/50872 



PCT/USOO/04794 



<400> 103 

atgctggccc ggaggaagcc ggtgctgccg gcgctcacca tcaac 45 

<210> 104 
<21X> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 

<400s^ 104 

Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu Thr lie Asn 
15 10 15 



<210> 105 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus /botulium substrate recognition sequence 

<400> 105 

gcctcgcagt ttgaaaca 18 



<210> 106 
<211> 6 

<212> PRT / 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium substrate recognition sequence 

<400> 106 

Ala Ser Gin Phe Glu Thr 
1 5 



<210> 107 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium substrate recognition sequence 

<400> 107 

gcttctcaat ttgaaacg IB 
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<210> 108 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus /botulium substrate recognition sequence 

<400> 108 

Ala Ser Gin Phe Glu Thr 
1 5 



<210> 109 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin A substrate recognition sequence 

<400> 109 

gccaaccaac gtgcaaca 



<210> 110 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin A substrate recognition sequence 

<400> 110 

Ala Asn Gin Arg Ala Thr 
1 5 



<210> 111 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin B substrate recognition sequence 

<400> 111 

gcttctcaat ttgaaacg 



<210> 112 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin B substrate recognition sequence 

<400> 112 

Ala Ser Gin Phe Glu Thr 
1 5 



<210> 113 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin C substrate recognition sequence 

<400> 113 

acgaaaaaag ctgtgaaa 



<210> 114 

<:211> 6 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin C substrate recognition secjuence 

<400> 114 

Thr Lys Lys Ala Val Lys 
1 5 



<210> 115 

<211> 18 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin D substrate recognition sequence 

<400> 115 

gaccagaagc tctctgag 



<210> 116 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin D substrate recognition sequence 

<400> 116 

Asp Gin Lys Leu Ser Glu 
1 5 



<210>- 117 
<211> IB 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin E substrate recognition sequence 

<400> 117 

atcgacagga tcatggag 



<210> 118 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin E substrate recognition sequence 

<400> 118 

lie Asp Arg lie Met Glu 
1 5 



<210> 119 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin F substrate recognition sequence 

<400> 119 

agagaccaga agctctct 



<210> 120 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence : Botulinum 
neurotoxin F substrate recognition secfuence 



105 



BNSDOCID: <WO O05Oe72A2j_> 



wo 00/50872 



PCT/USOO/04794 



<400> 120 

Arg Asp Gin Lys Leu Ser 
1 5 



<210> 121 

<211> 18 

<212> DNA 

<213> Artificial Sec[uence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin G substrate recognition sequence 

<400> 121 

acgagcgcag ccaagttg -^^ 



<210> 122 
<211> 6 
<212> PRT 

<213> Artificial Sequence 

<220> • 
<223> Description of Artificial Sequence: Botulinum 
neurotoxin G substrate recognition sequence 

<400> 122 

Thr Ser Ala Ala Lys Leu 
1 5 



<210> 123 / 

<211> 69 ^ 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Cytoplasm/ cytoskeleton target sequence 

<400> 123 

atgtctactg tccacgaaat cctgtgcaag ctcagcttgg agggtgttca ttctacaccc 60 
ccaagtgcc 



<210> 124 

<211> 23 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Cytoplasm/ cytoskeleton target sequence 
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<400> 124 

Met Ser Thr Val His Glu lie Leu Cys Lys Leu Ser Leu Glu Gly Val 
1 5 ' 10 15 

His Ser Thr Pro Pro Ser Ala 
20 



<210> 125 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inner surface 
of plasma membrane target sequence 

<400> 125 

atgggatgta cattaagcgc agaagacaaa gcagcagtag aaagaagcaa aatgatagac 60 
agaaacttaa gagaagacgg agaaaaagct gctaga 56 



<210> 126 
<211> 32 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
of plasma membrane target 

<400> 126 

Met Gly Cys Tbr Leu Ser Ala Glu 
1 5 

Lys Met He Asp Arg Asn Leu Arg 
20 



Sequence: Inner surface 
sequence 

Asp Lys Ala Ala Val Glu Arg Ser 
10 ' /15 

Glu Asp Gly Glu Lys Ala Ala Arg 
25 30 



<210> 127 
<211> IB 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleus target 
sequence 

<400> 127 

agaaggaaac gacaaaag 



<210> 128 
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<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleus target 
sequence 

<400> 128 

Arg Arg Lys Arg Gin Lys 
1 5 



<210> 129 
<211> 90 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleolus 
target sequence 

<400> 129 

agaaaacgta tacgtactta cctcaagtcc tgcaggcgga tgaaaagaag tggttttgag 60 
atgtctcgac ctattccttc ccaccttact 90 



<210> 130 
<2ai> 30 
<212> PRT 

<213> Artificial Sequence 

<220> ^ 
<223> Description of Artificial Sequence: Nucleolus 
target sequence 

<400> 130 

Arg Lys Arg lie Arg Thr Tyr Leu Lys Ser Cys Arg Arg Met Lys Arg 
15 10 IS 

Ser Gly Phe Glu Met Ser Arg Pro lie Pro Ser His Leu Thr 
20 25 30 



<210> 131 
<211> 87 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence: Mitochondria 
target sequence 

<400> 131 

atgtccgtcc tgacgccgct gctgctgcgg ggcttgacag gctcggcccg gcggctccca 60 
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<210> 132 
<211> 29 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mitochondria 
target sequence 

<400> 132 

Met Ser Val Leu Thr Pro Leu Leu Leu Arg Gly Leu Thr Gly Ser Ala 
15 10 15 

Arg Arg Leu Pro Val Pro Arg Ala Leu lie His Ser Leu 
20 25 



<210> 133 

<211> 99 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear 
Envelope target sequence 



<400> 133 

atgagcattg ttttaataat tgttattgtg gtgatttttt taatatgttt tttatattta 60 



agcaacagca aagatcccag agtaccagtt gaattaatg 



99 



<210> 134 
<211> 33 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Nuclear 
Envelope target sequence 

<400> 134 

Met Ser lie Val Leu He He Val He Val Val He Phe Leu He Cys 
1 5 10 15 

Phe Leu Tyr Leu Ser Asn Ser Lys Asp Pro Arg Val Pro Val Glu Leu 
20 25 30 

Met 



<210> 135 
<211> 246 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Golgi target 
sequence 

<400> 135 

atgaggcttc gggagccgct cctgagcggc agcgccgcga tgccaggcgc gtccctacag 60 

cgggcctgcc gcctgctcgt ggccgtctgc gctctgcacc ttggcgtcac cctcgtttac 120 

tacctggctg gccgcgacct gagccgcctg ccccaactgg tcggagtctc cacaccgctg 180 

cagggcggct cgaacagtgc cgccgccatc gggcagtcct ccggggagct ccggaccgga 240 



ggggCC 



<210> 136 
<211> 82 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Golgi target 
sequence 

<400> 136 

Met Arg Leu Arg Glu Pro Leu Leu Ser Gly Ser Ala Ala Met Pro Gly 
1 5 10 15 

Ala Ser Leu Gin Arg Ala Cys Arg Leu Leu Val Ala Val Cys Ala Leu 
20 25 

His Leu Gly Val Tlir Leu Val Tyr Tyr Leu Ala Gly Arg Asp Leu Ser 
35 40 45 

Arg Leu Pro Gin Leu Val Gly Val Ser Thr Pro Leu Gin Gly Gly Ser 
50 55 60 

Asn Ser Ala Ala Ala He Gly Gin Ser Ser Gly Glu Leu Arg Thr Gly 
65 70 75 80 

Gly Ala 



246 



<210> 137 
<211> 150 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Endoplasmic 
reticulum target sequence 
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<400> 137 

gaaacaataa gacctataag 
atggcaattc aattaagatc 
ggatggtggt ggtttttcag 



aataagaaga tgttcttatt 
tccctttcca ttagcattac 
tagaaaaaaa 



ttacatctac agacagcaaa 60 
caggaatgtt agctttatta 12 0 

150 



<210> 138 
<211> 50 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Endoplasmic 
reticulum target sequence 

<400> 138 

Glu Thr lie Arg Pro lie Arg He Arg Arg Cys Ser Tyr Phe Thr Ser 
1 5 10 15 

Thr Asp Ser Lys Met Ala He Gin Leu Arg Ser Pro Phe Pro Leu Ala 
20 25 30 

Leu Pro Gly Met Leu Ala Leu Leu Gly Trp Trp Trp Phe Phe Ser Arg 
35 40 45 

Lys Lys 
50 



<210> 139 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 139 

gccttgcaga agaagctgga ggagctagag cttgatgag 



<210> 140 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 140 

Ala Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu 
1 5 10 . 
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<210> 141 
<211> 1024 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Size exclusion 
target sequence 



<400> 141 
gccgacctca 


gtcttgtgga 


tgcgttgaca 


gaaccacctc 


cagaaattga 


gggagaaata 


60 


aagcgagact 


tcatggctgc 


gctggaggca 


gagccctatg atgacatcgt gggagaaact 


120 


gtggagaaaa 


ctgagtttat 


tcctctcctg 


gatggtgatg 


agaaaaccgg 


gaactcagag 


180 


tccaaaaaga 


aaccctgctt 


agacactagc 


caggttgaag gtatcccatc 


ttctaaacca 


240 


acactcctag 


ccaatggtga 


tcatggaatg 






gtctccaact 


300 


gacttccttg 


aagagagagt 


ggactatccg 


gattatcaga 


gcagccagaa 


ctggccagaa 


360 


gatgcaagct 


tttgtttcca 


gcctcagcaa 


gtgttagata 


ctgaccaggc 


tcraacccttt 

w *H ^* z3 ^ 


420 


aacgagcacc 


gtgatgatgg 


tttggcagat 


ctgctctttg 


tctccagtgg 


acccacgaac 


480 


gcttctgcat 


ttacagagcg 


agacaatcct 


tcagaagaca 


gttacggtat 


gcttccctgt 


540 


gactcatttg 


cttccacggc 


tgttgtatct 


caggagtggt 


ctgtgggagc 


cccaaactct 


600 


ccatgttcag 


agtcctgtgt 


ctccccagag 


gttactatag 


aaaccctaca 


gccagcaaca 


660 


gagctctcca 


aggcagcaga 


agtggaatca 


gtgaaagagc 


agctgccagc 


taaagcat^ig 


720 


gaaacgatgg 


cagagcagac 


cactgatgtg 


gtgcactctc 


catccacaga 


cacaacacca 


7 80 


ggcccagaca 


cagaggcagc 


actggctaaa 


gacatagaag 


agatcaccaa 


gccagatgtg 


840 


atattggcaa 


atgtcacgca 


gccatctact 


gaatcggata 


tgttcctggc 


ccaggacatg 


900 


gaactactca 


caggaacaga 


ggcagcccac 


gctaacaata 


tcatattgcc 


tacagaacca 


960 


gacgaatctt 


caaccaagga 


tgtagcacca 


cctatggaag 


aagaaattgt 


cccaggcaat 


1020 


gata 












1024 



<210> 142 
<211> 566 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Size exclusion 
target sequence 
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<400> 142 

Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu He 
15 10 15 

Glu Gly Glu He Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu Pro 
20 25 30 

Tyr Asp Asp He Val Gly Glu Thr Val Glu Lys Thr Glu Phe He Pro 
35 40 45 

Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys Lys 
50 55 60 

Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys Pro 
65 70 75 80 

Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala 
85 90 95 

Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr 
100 105 110 

Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin , Pro 
115 120 125 

Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His Arg 
130 135 140 

Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn 
145 150 155 160 

Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly 
165 170 175 

Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu 
180 185 190 

Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser 
195 200 205 

Pro Glu Val Thr He Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys 
210 215 220 

Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu 
225 230 235 240 

Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser Thr 
245 250 255 

Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp He 
260 265 270 

Glu Glu He Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin Pro 
275 280 285 

Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu Thr 
290 295 300 
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Gly Thr Glu Ala Ala Hie Ala Asn Asn lie He Leu Pro Thr Glu Pro 
305 310 315 320 

Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu He 
325 330 335 

Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr Leu 
340 345 350 

Pro lie Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr Lys 
355 360 , 365 

Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu He 
370 375 380 

Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro Val 
385 390 395 400 

Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr Glu 
405 410 415 

Val Val Leu Pro Ser Asp Pro He Thr Thr Leu Thr Lys Asp Val Thr 
420 425 430 

Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro Ser 
435 - 440 445 

Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu 
450 455 . 460 

Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser Glu 
465 470 475 480 

Val Thr Leu Gly Lys Asp Val Val He Leu Pro Glu Thr Lys Val Ala 
485 490 . 495 

Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val 
500 505 510 

Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn 
515 520 525 

Ala Asp Leu His Ser Gly Thr Glu Leu He Val Asp Asn Ser Met Ala 
530 535 540 

Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val 
545 550 555 560 

Pro He Lys Asp Lys Gly 
565 



<210> 143 
<211> 63 
<212> DNA 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 143 

atgtgggcaa tcgggattac tgttctggtt atcttcatca tcatcatcat cgtgtgggtt 60 
gtc 63 



<210> 144 

<211> 21 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 144 

Met Trp Ala He Gly He Thr Val Leu Val He Phe He He He He 
1 5 10 15 

He Val Trp Val Val 
20 



<210> 145 
<211> 61 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane* target sequence 

<400> 145 

atgtgggcga tagggatcag tgtcctggtg atcattgtca tcatcatcat cgtgtggtgt 60 



<210> 146 
<211> 20 
<212> PRT 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 146 

Met Trp Ala He Gly He Ser Val Leu Val He He Val He He He 
1 5 10 15 

He Val Trp Cya 
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20 



<210> 147 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target secpience 

<400> 147 

gacctgcaga agaagctgga ggagctggaa cttgacgag 



<210> 148 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 148 . 

Asp Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu 
1 5 10 



<210> 149 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peroxisome 
target sequence 

<400> 149 
tctaaactg 



<210> 150 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peroxisome 
target sequence 

<400> 150 
Ser Lys Leu 
1 



116 



wo 00/50872 



PCT/USOO/04794 



48 



96 



144 



<210> 151 
<211> 3378 
<212> DNA 

<213> MuB musculus 

<220> 
<221> CDS 
<222> (1) . . (3375) 

<400> 151 

atg gcc gac etc agt ctt gtg gat gcg ttg aca gaa cca cct cca gaa 
Met Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu 
15 10 15 

att gag gga gaa ata aag cga gac ttc atg get gcg ctg gag gca gag 
He Glu Gly Glu He Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu 
20 25 30 

ccc tat gat gac ate gtg gga gaa act gtg gag aaa act gag ttt att 
Pro Tvr ASP Asp He Val Gly Glu Thr Val Glu Lys Thr Glu Phe He 
35 40 45 

cct etc ctg gat ggt gat gag aaa acc ggg aac tea gag tec aaa aag 192 
Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys 
50 55 60 

aaa ccc tgc tta gac act age cag gtt gaa ggt ate cca tct tct aaa 240 
Lvs Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys 
65 70 75 80 

cca aca etc eta gcc aat ggt gat cat gga atg gag ggg aat aac act 
Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr 
85 90 95 

gca ggg tct cca act gac ttc ctt gaa gag aga gtg gac tat ccg gat 
Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp 
100 105 110 

tat cag age age cag aac tgg cca gaa gat gca age ttt tgt ttc cag 
Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin 
115 120 125 

cct cag caa gtg tta gat act gac cag get gag ccc ttt aac gag cac 
Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu Hxs 
130 135 140 

cgt gat gat ggt ttg gca gat ctg etc ttt gtc tec agt gga ccc acg 
Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr 
145 ISO 155 160 

aac get tct gca ttt aca gag cga gac aat cct tea gaa gac agt tac 
Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr 
165 170 175 

ggt atg ctt ccc tgt gac tea ttt get tec acg get gtt gta tct cag 576 
Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin 
180 185 190 
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gag tgg tct gtg gga gcc cca aac tct cca tgt tea gag tec tgt gtc 624 
Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu Ser Cys Val 
195 200 205 

tec cca gag gtt act ata gaa acc eta cag cca gca aca gag etc tec 672 
Ser Pro Glu Val Thr lie Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser 
210 215 220 



aag gca gca gaa gtg gaa tea gtg aaa gag cag ctg cca get aaa gca 
Lys Ala Ala Glu Val Glu Ser Val Lys Glu Glxi Leu Pro Ala Lys Ala 
225 230 235 240 

ttg gaa acg atg gca gag cag acc act gat gtg gtg cac tct cca tec 
Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser 
245 250 255 

aca gac aca aca cca ggc cca gac aca gag gca gca ctg get aaa gac 
Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp 
260 265 270 

ata gaa gag ate ace aag cca gat gtg ata ttg gca aat gtc acg cag 
lie Glu Glu He Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin 
275 280 285 



720 



768 



616 



864 



1008 



1056 



cca tct act gaa teg gat atg ttc ctg gee cag gac atg "gaa eta etc 9X2 
Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu 
290 295 300 

aca gga aca gag gca gee cac get aac aat ate ata ttg cet aca gaa 960 
Thr Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu 
305 310 315 320 

cca gac gaa tct tea acc aag gat gta gca cca cet atg gaa gaa gaa 
Pro Asp Glu Ser Ser Thr Lys Asp Val'^Ala Pro Pro Met Glu Glu Glu 
325 330 . 335 

att gtc cca ggc aat gat acg aca tec cec aaa gaa aca gag aca aca 
He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr 
340 345 350 

ctt cca ata aaa atg gac ttg gca cca cet gag gat gtg tta ctt acc 1104 
Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr 
355 360 365 

aaa gaa aca gaa eta gee cca gcc aag ggc atg gtt tea etc tea gaa 1152 
Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu 
370 375 380 

ata gaa gag get ctg gca aag aat gat gtt egc tct gca gaa ata cet 1200 
He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro 
385 390 395 400 

gtg get cag gag aca gtg gtc tea gaa aca gag gtg gtc ctg gca aca 1248 
Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr 
405 410 415 
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gaa gtg gta ctg ccc tea gat ccc ata aca aca ttg aca aag gat gtg 1296 
Glu Val Val Leu Pro Ser Asp Pro lie Thr Thr Leu Thr Lys Asp Val 
420 425 430 



aca etc ccc tta gaa gca gag aga ccg ttg gtg acg gac atg act cca 
Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro 
435 440 445 



gaa gac tec cag tta gca tct atg cag cac aag gga cag tea aca gta 
Glu Asp Ser Gin Leu Ala Ser Met Gin His Lys Gly Gin Ser Thr Val 
580 585 590 



tct acc tta cca ata gat gca cet tct cca tta gag aac tta gag cag 
Ser Thr Leu Pro lie Asp Ala Pro Ser Pro Leu Glu Asn Leu Glu Gin 
610 615 620 



1344 



tct ctg gaa aca gaa atg acc eta ggc aaa gag aca get cca ccc aca 13 92 
Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr 
450 455 460 

gaa aca aat ttg ggc atg gcc aaa gac atg tct cca etc cca gaa tea 1440 
Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser 
465 470 475 480 

gaa gtg act ctg ggc aag gac gtg gtt ata ett cca gaa aca aag gtg 14 88 
Glu val Thr Leu Gly Lys Asp Val Val lie Leu Pro Glu Thr Lys Val 
485 490 495 

get gag ttt aac aat gtg act cca ett tea gaa gaa gag gta acc tea 1536 
Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser 
500 505 510 

gtc aag gac atg tct ccg tct gca gaa aca gag get ccc ctg get aag 1584 
Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys 
515 " 520 525 

aat get gat ctg cac tea gga aca gag ctg att gtg gac aac age atg 1632 
Asn Ala Asp Leu His Ser Gly Thr Glu Leu lie Val Asp Asn Ser Met 
530 535 540 

get cca gcc tec gat ett gca ctg ccc ttg gaa aca aaa gta gca aca 1680 
Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu Thr Lys Val Ala Thr 
545 550 555 560 

gtt cca att aaa gac aaa gga act gta cag act gaa gaa aaa cca cgt 1728 
Val Pro lie Lys Asp Lys Gly Thr Val Gin Thr Glu Glu Lys Pro Arg 
565 570 575 



1776 



cet cet tgc acg get tea cca gaa cca gtc aaa get gca gaa caa atg 1824 
Pro Pro Cys Thr Ala Ser Pro Glu Pro Val Lys Ala Ala Glu Gin Met 
595 600 605 



1872 



aag gaa acg cet ggc age cag cet tct gag cet tgc tea gga gta tec , 1920 

Lys Glu Thr Pro Gly Ser Gla Pro Ser Glu Pro Cys Ser Gly Val Ser 

625 630 635 640 

egg caa gaa gaa gca aag get get gta ggt gtg act gga aat gac ate 1968 
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Arg Gin Glu Glu Ala Lys Ala Ala Val Gly Val Thr Gly Asn Asp Xle 
645 650 655 

act acc ccg cca aac aag gag cca cca cca age cca gaa aag aaa gca 2016 

Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro Ser Pro Glu Lys Lys Ala 
660 665 670 

aag cct ttg gcc acc act caa cct gca aag act tea aca teg aaa gcc 2 064 

Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys Thr Ser Thr Ser Lys Ala 
675 680 685 

aaa aca cag ccc act tct etc cct aag caa cca get ccc acc acc tct 2112 

Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin Pro Ala Pro Thr Thr Ser 
690 695 700 

ggt ggg ttg aat aaa aaa ccc atg age etc gcc tea ggc tea gtg cca 2160 

Gly Gly Leu Asn Lys Lys Pro Met Ser Leu Ala Ser Gly Ser Val Pro 

705 710 715 720 

get gee cca cac aaa cgc cct get get gcc act get act gcc agg cct 2208 

Ala Ala Pro His Lys Arg Pro Ala Ala Ala Thr Ala Thr Ala Arg Pro 
725 730 735 

tec acc eta cct gee aga. gac gtg aag cca aag cca att aca gaa get 2256 

Ser Thr Leu Pro Ala Arg Asp Val Lys Pro Lys Pro lie Thr Glu Ala 
740 - , 745 750 

aag gtt gee gaa aag egg acc tct cca tec aag cct tea tct gcc cca 23 04 

Lys Val Ala Glu Lys Arg Thr Ser Pro Ser Lys Pro Ser Ser Ala Pro 
755 760 765 

gcc etc aaa cct gga cct aaa acc acc cca acc gtt tea aaa gee aca 2352 

Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro Thr Val Ser Lys Ala Thr 
770 . 775 780 

tct ccc tea act ett gtt tec act gga cca .agt agt aga agt cca get 2400 

Ser Pro Ser Thr Leu Val Ser Thr Gly Pro Ser Ser Arg Ser Pro Ala 

785 790 795 800 

aca act ctg cct aag agg cca acc age ate aag act gag ggg aaa cct 2448 

Thr Thr Leu Pro Lys Arg Pro Thr Ser lie Lys Thr Glu Gly Lys Pro 
805 810 815 

get gat gtc aaa agg atg act get aag tct gcc tea get gac ttg agt 2496 

Ala Asp Val Lys Arg Met Thr Ala Lys Ser Ala Ser Ala Asp Leu Ser 
820 825 830 

cgc tea aag acc ace tct gcc agt tct gtg aag aga aac acc act ccc 2544 

Arg Ser Lys Thr Thr Ser Ala Ser Ser Val Lys Arg Asn Thr Thr Pro 
635 840 845 

act ggg gca gca ccc cca gca ggg atg act tec act cga gtc aag ccc 25 92 

Thr Gly Ala Ala Pro Pro Ala Gly Met Thr Ser Thr Arg Val Lys Pro 
850 855 860 

atg tct gca cct age cgc tct tct ggg get ett tct gtg gac aag aag 2640 

Met Ser Ala Pro Ser Arg Ser Ser Gly Ala Leu Ser Val Asp Lye Lys 
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865 870 875 880 

ccc act tec act aag cot age tec tct get ecc agg gtg age cgc ctg 

Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala Pro Arg Val Ser Arg Leu 

885 890 895 



ggc tct aca gaa aac ate aaa cac cag cct gga gga gge egg gee aaa 
Gly Ser Thr Glu Asn lie Dys His Gin Pro Gly Gly Gly Arg Ala Lys 
915 920 925 



tgt ggc aat gtt cag att cag aac aag aaa gtg gac ata tec aag gtc 

Cys Gly Asn Val Gin lie Gin Asn Lys Lys Val Asp lie Ser Lys Val 

995 1000 1005 

tec tee aag tgt ggg tec aaa get aat ate aag cac aag cct ggt gga 

Ser Ser Lys Cys Gly Ser Lys Ala Asn lie .Lys His Lys Pro Gly Gly 

1010 lOlS 1020 



2688 



gee aca act gtt tct gee cct gac ctg aag agt gtt cgc tec aag gtc 2736 
Ala Thr Thr Val Ser Ala Pro Asp Leu Lys Ser Val Arg Ser Lys Val 
900 905 910 



2784 



gta gag aaa aaa aca gag gca get acc aca get ggg aag cct gaa cct 2832 
Val Glu Lys Lys Thr Glu Ala Ala Thr Thr Ala Gly Lys Pro Glu Pro 

930 935 940 

aat gca gtc act aaa gca gee ggc tec att gcg agt gca cag aaa ccg 2880 

Asn Ala Val Thr Lys Ala Ala Gly Ser He Ala Ser Ala Gin Lys Pro 
945 950 955 960 

cct get ggg aaa gtc cag ata gta tec aaa aaa gtg age tac agt cat 

Pro Ala Gly Lys Val Gin He Val Ser Lys Lys Val Ser Tyr Ser His 
965 970 975 

att caa tec aag tgt gtt tec aag gac aat att aag cat gtc cct gga 2976 

He Gin Ser Lys Cys Val Ser Lys Asp Asn He Lys His Val Pro Gly 
980 985 990 



2928 



3024 



3072 



gga gat gtc aag att gaa agt cag aag ttg aac ttc aag gag aag gee 3120 
Gly Asp Val Lys He Glu Ser Gin Lys Leu Asn Phe Lys Glu Lys Ala 
1025 1030 1035 1040 

caa gee aaa gtg gga tec ctt gat aac gtt ggc cac ttt cct gca gga 3168 
Gin Ala Lys Val Gly Ser Leu Asp Asn Val Gly His Phe Pro Ala Gly 
1045 1050 1055 

ggt gee gtg aag act gag ggc ggt ggc agt gag gee ctt ccg tgt cca 3216 
Gly Ala Val Lys Thr Glu Gly Gly Gly Ser Glu Ala Leu Pro Cys Pro 
1060 1065 1070 

gge ccc ccc get ggg gag gag cca gtc ate cct gag get gcg cct gac 3264 
Gly Pro Pro Ala Gly Glu Glu Pro Val He Pro Glu Ala Ala Pro Asp 
1075 1080 1085 

cgt ggc gcc cct act tea gee agt ggc etc agt ggc cac acc acc ctg 3312 
Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu Ser Gly His Thr Thr Leu 
1090 1095 1100 
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tea ggg ggt ggt gac caa agg gag ccc cag acc ttg gac age cag ate 3360 
Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin Thr Leu Asp Ser Gin He 
1105 1110 1115 



cag gag aca age ate taa 
Gin Glu Thr Ser lie 
1125 



<210> 152 

<211> 1125 

<212> PRT 

<213> Mus museulus 

<400> 152 

Met Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu 
1 5 10 15 

He Glu Gly Glu He Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu 
20 25 30 

Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu Lys Thr Glu Phe He 
3 5 40 45 

Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys 
50 55 60 

Lvs Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys 
65 70 75 80 

Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr 
85 90 95 

Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp 
100 105 110 

Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin 
115 120 125 

Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His 
130 135 140 

Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr 
145 150 155 160 

Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr 
165 170 175 

Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin 
180 185 190 

Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu. Ser Cys Val 
195 200 205 

Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser 
210 215 220 
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Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala 
225 230 235 240 

Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser 
245 250 255 

Thr Asp Thx Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp 
260 265 270 

He Glu Glu He Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin 
275 280 285 

Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu 
290 295 300 

Thr Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu 
305 310 315 320 

Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu 
325 330 335 

He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr 
340 345 350 

Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr 
355 360 ' 365 

Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu 
370 375 380 

He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro 
385 390 395 400 

Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr 
405 410 . 415 

Glu Val Val Leu Pro Ser Asp Pro He Thr Thr Leu Thr Lys Asp Val 
420 425 430 

Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro 
435 440 445 

Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr 
450 455 460 

Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser 
465 470 475 480 

Glu Val Thr Leu Gly Lys Asp Val Val He Leu Pro Glu Thr Lys Val 
485 490 495 

Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser 
500 505 510 

Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys 
515 520 525 
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Asn Ala Asp Leu His Ser Gly Thr Glu lieu lie Val Asp Asn Ser Met 
530 535 540 

Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu Thr Lys Val Ala Thr 
545 550 555 560 

Val Pro lie Lys Asp Lys Gly Thr Val Gin Thr Glu Glu Lys Pro Arg 
565 570 575 

Glu Asp Ser Gin Leu Ala Ser Met Gin His Lys Gly Gin Ser Thr Val 
580 585 590 

Pro Pro Cys Thr Ala Ser Pro Glu Pro Val Lys Ala Ala Glu Gin Met 
595 600 605 

Ser Thr Leu Pro lie Asp Ala Pro Ser Pro Leu Glu Asn Leu Glu Gin 
610 615 620 

Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu Pro Cys Ser Gly Val Ser 
625 630 635 640 

Arg Gin Glu Glu Ala Lys Ala Ala Val Gly Val Thr Gly Asn Asp He 
645 650 655 

Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro Ser Pro Glu Lys Lys' Ala 
660 665 670 

Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys Thr Ser Thr Ser Lys Ala 
675 680 685 

Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin Pro Ala Pro Thr Thr Ser 
690 695 700 

Gly Gly Leu Asn Lys Lys Pro M^t Ser Leu Ala Ser Gly Ser Val Pro 
705 710 ,715 720 

Ala Ala Pro His Lys Arg Pro Ala Ala Ala Thr Ala Thr Ala Arg Pro 
725 730 735 

Ser Thr Leu Pro Ala Arg Asp Val Lys Pro Lys Pro lie Thr Glu Ala 
740 745 750 

Lys Val Ala Glu Lys Arg Thr Ser Pro Ser Lys Pro Ser Ser Ala Pro 
755 760 765 

Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro Thr Val Ser Lys Ala Thr 
770 775 780 

Ser Pro Ser Thr Leu Val Ser Thr Gly Pro Ser Ser Arg Ser Pro Ala 
785 . 790 795 800 

Thr Thr Leu Pro Lys Arg Pro Thr Ser He Lys Thr Glu Gly Lys Pro 
805 BIO 815 

Ala Asp Val Lys Arg Met Thr Ala Lys Ser Ala Ser Ala Asp Leu Ser 
820 825 830 
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Arg Ser Lys Thr Thr Ser Ala Ser Ser Val Lys Arg Asn Thr Thr Pro 
835 840 845 

Thr Gly Ala Ala Pro Pro Ala Gly Met Thr Ser Thr Arg Val Lys Pro 
850 855 860 

Met Ser Ala Pro Ser Arg Ser Ser Gly Ala Leu Ser Val Asp Lys Lys 
865 870 875 880 

Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala Pro Arg Val Ser Arg Leu 
885 890 895 

Ala Thr Thr Val Ser Ala Pro Asp Leu Lys Ser Val Arg Ser Lys Val 
900 905 910 

Gly Ser Thr Glu Asn lie Lys His Gin Pro Gly Gly Gly Arg Ala Lys 
915 920 925 

Val Glu Lys Lys Thr Glu Ala Ala Thr Thr Ala Gly Lys Pro Glu Pro 
930 935 940 

Asn Ala Val Thr Lys Ala Ala Gly Ser lie Ala Ser Ala Gin Lys Pro 
945 950 955 960 

Pro Ala Gly Lys Val Gin lie Val Ser Lys Lys Val Ser Tyr Ser His 
965 970 ' 975 

lie Gin Ser Lys Cys Val Ser Lys Asp Asn He Lys His Val Pro Gly 
980 985 990 

Cys Gly Asn Val Gin He Gin Asn Lys Lys Val Asp He Ser Lys Val 
995 1000 1005 

Ser Ser Lys Cys Gly Ser Lys Ala Asn He Lys His Lys Pro Gly Gly 
1010 1015 ^ 1020 

Gly Asp Val Lys He Glu Ser Gin Lys Leu Asn Phe Lys Glu. Lys Ala 
1025 1030 1035 1040 

Gin Ala Lys Val Gly Ser Leu Asp Asn Val Gly His Phe Pro Ala Gly 
1045 1050 1055 

Gly Ala Val Lys Thr Glu Gly Gly Gly Ser Glu Ala Leu Pro Cys Pro 
1060 1065 1070 

Gly Pro Pro Ala Gly Glu Glu Pro Val He Pro Glu Ala Ala Pro Asp 
1075 1080 1085 

Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu Ser Gly His Thr Thr Leu 
1090 1095 1100 

Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin Thr Leu Asp Ser Gin He 
1105 1110 1115 1120 

Gin Glu Thr Ser He 
1125 
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<210> 153 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 153 

tcatcatccg gagctggagc cggagctggc cgatcggctg ttaaatctga aggaaagaga 60 
aagtgtgacg aagttgatgg aattgatgaa gtagca 96 



<210> 154 
<211> 99 
<212> DNTA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> -154 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtacttgc tacttcatc 99 



<210> 155 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 155 

tcatcatccg gagctgga 18 



<210> 156 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secpience : 
oligonucleotide 

<400> 156 

gaagaaggat ccggcact IB 
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<210> 157 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> f 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 157 

tcatcatccg gaagaaggaa acgacaaaag cgatcggctg ttaaatctga aggaaagaga 60 
aagtgtgacg aagttgatgg aattgatgaa gtagca 96 



<210> 158 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<460> 158 

tcatcatccg gaagaagg 



<210> 159 
<211> 60 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of TUrtificial Sequence: 
oligonucleotide 

<400> 159 

tcatcatccg gaagaaggaa acgacaaaag cgatcgacaa gacttgttga aattgacaac 60 



<210> 160 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 160 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtactgtt gtcaatttc 99 
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<210> 161 
<211> 84 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 161 

tcatcatccg gaagaaggaa acgacaaaag cgatcgtatc aaaaaggaat accagttgaa 60 
acagacagcg aagagcaacc ttat ®* 



<210> 162 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 162 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtactata aggttgctc ^9 



<210> 163 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: 
oligonucleotide 

<400> 163 

tcatcatccg gaagaaaacg tatacgtact tacctcaagt cctgcaggcg gatgaaaaga 6 0 



<210> 164 
<211> 63 
<212> DNA 

<:213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 164 

gaagaacgat cgagtaaggt gggaaggaat aggtcgagac atctcaaaac cacttctttt 60 
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<210> 165 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 165 

tcatcatccg gaagaaaa 



<210> 166 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
o 1 i gonuc 1 eo t ide 

<400> 166 

gaagaacgat cgagtaag 



<210> 167 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspaee-1, 4 , 5 
substrate recocfnition secjuence 

<400> 167 
ttagaacatg acaa 



<210> 168 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-l , 4 , 5 
substrate recognition sequence 

<400> 168 
Leu Glu His Asp 
1 

<210> 169 
<211> 1380 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: GFP-HSP27 

<220> 

<221> CDS 

<222> (1) . . (1380) 

<400> 169 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr .Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 33 6 
Arg Thr lie Phe Phe Lys'^ Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 . 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tet cga gcg gcg tec aga gca gag tea gee age atg acc 768 
Gly Leu Arg Ser Arg Ala Ala Ser Arg Ala Glu Ser Ala Ser Met Thr 
245 250 255 

gag cgc cgc gtc ccc ttc teg etc ctg egg ggc ccc age tgg gac ccc 816 
Glu Arg Arg Val Pro Phe Ser Leu Leu Arg Gly Pro Ser Trp Asp Pro 
260 265 270 

ttc cgc gae tgg tac ceg cat age cgc etc ttc gac cag gcc ttc ggg 864 
Phe Arg Asp Trp Tyr Pro His Ser Arg Leu Phe Asp Gin Ala Phe Gly 
275 280 285 

ctg ccc egg ctg ccg gag gag tgg teg cag tgg tta ggc ggc age age 912 
Leu Pro Arg Leu Pro Glu Glu Trp Ser Gin Trp Leu Gly Gly Ser Ser 
290 295 300 

tgg cca ggc tac gtg cgc ccc ctg ccc ccc gcc gcc ate gag age ccc 960 
Trp Pro Gly Tyr Val Arg Pro Leu Pro Pro Ala Ala lie Glu Ser Pro 
305 310 315 320 

gca gtg gcc gcg ccc gcc tac age cgc gcg etc age egg caa etc age 1008 
Ala Val Ala Ala Pro Ala Tyr Ser Arg Ala Leu Ser Arg Gin Leu Ser 
325 / 330 335 

age ggg gtc teg gag ate egg cac act gcg gac cgc tgg cgc gtg tec 1056 
Ser Gly Val Ser Glu lie Arg His Thr Ala Asp Arg Trp Arg Val Ser 
340 345 350 

ctg gat gtc aac cac ttc gcc ceg gae gag ctg acg gtc aag acc aag 1104 
Leu Asp Val Asn His Phe Ala Pro Asp Glu Leu Thr Val Lys Thr Lys 
355 360 365 

gat ggc gtg gtg gag ate ace ggc aag cac gag gag egg cag gac gag 1152 
Asp Gly Val Val Glu lie Thr Gly Lys His Glu Glu Arg Gin Asp Glu 
370 375 380 

cat ggc tac ate tec egg tgc ttc acg egg aaa tac acg ctg ccc ccc 1200 
His Gly Tyr lie Ser Arg Cys Phe Thr Arg Lys Tyr Thr Leu Pro Pro 
385 390 395 400 

ggt gtg gac ccc acc caa gtt tec tec tec ctg tec ect gag ggc aca 1248 
Gly Val Asp Pro Thr Gin Val Ser Ser Ser Leu Ser Pro Glu Gly Thr 
405 410 415 



ctg acc gtg gag gcc ccc atg ccc aag eta gcc acg cag tec aac gag 
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Leu Thr Val Glu Ala Pro Met Pro Lys Leu Ala Thr Gin Ser Asn Glu 
420 425 430 

ate acc ate cca gtc acc ttc gag teg egg gee cag ctt ggg ggc cea 1344 

lie Thr He Pro Val Thr Phe Glu Ser Arg Ala Gin Leu Gly Gly Pro 
435 440 445 



gaa get gca aaa tec gat gag act gcc gee aag taa 
Glu Ala Ala Lys Ser Asp Glu Thr Ala Ala Lys 

450 455 460 



<210> 170 
<211> 459 
<212> PRT 

<2i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSP27 
<400> 170 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 €0 

Leu i:hr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lye Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Ala Ser Arg Ala Glu Ser Ala Ser Met Thr 
245 250 255 

Glu Arg Arg Val Pro Phe Ser Leu Leu Arg Gly Pro Ser Trp Asp Pro 
260 265 270 

Phe Arg Asp Trp Tyr Pro His Ser Arg Leu Phe Asp Gin Ala Phe Gly 
275 280 285 

Leu Pro Arg Leu Pro Glu Glu Trp Ser Gin Trp Leu Gly Gly Ser Ser 
290 295 300 

Trp Pro Gly Tyr Val Arg Pro Leu Pro Pro Ala Ala lie Glu Ser Pro 
305 310 315 320 

Ala Val Ala Ala Pro Ala Tyr Ser Arg Ala Leu Ser Arg Gin Leu Ser 
325 330 335 

Ser Gly Val Ser Glu He Arg His Thr Ala Asp Arg Trp Arg Val Ser 
340 345 350 

Leu Asp Val Asn His Phe Ala Pro Asp Glu Leu Thr Val Lys Thr Lys 
355 360 365 

Asp Gly Val \Jal Glu He Thr Gly Lys His Glu Glu Arg Gin Asp Glu 
370 375 380 

His Gly Tyr He Ser Arg Cys Phe Thr Arg Lys Tyr Thr Leu Pro Pro 
385 390 395 400 

Gly Val Asp Pro Thr Gin Val Ser Ser Ser Leu Ser Pro Glu Gly Thr 
405 410 415 

Leu Thr Val Glu Ala Pro Met Pro Lys Leu Ala Thr Gin Ser Asn Glu 
420 425 430 

He Thr He Pro Val Thr Phe Glu Ser Arg Ala Gin Leu Gly Gly Pro 
435 440 445 

Glu Ala Ala Lys Ser Asp Glu Thr Ala Ala Lys 
450 455 



<210> 171 
<211> 2823 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: GFP-HSP70 

<220> 

<221> CDS 

<222> (1) . . (2823) 

<400> 171 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
X 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 . 



ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 ' 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 
Arg Thr He Phe^Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
XOO 105 110 



gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



48 



96 



144 



tgc acc ace ggc aag ctg ccc gtg ccc tgg ccc ace etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



240 



288 



336 



gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 . 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 4 80 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 52 8 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 



576 



ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 624 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 2X5 220 

gtg acc gcc gcc ggg ate act etc gge atg gac gag ctg tac aag tec 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga atg teg gtg gtg gge ata gac ctg ggc tte eag age tgc tac gtc 
Gly Met Ser Val Val Gly He Asp Leu Gly Phe Gin Ser Cys Tyr Val 
245 250 255 

get gtg gee cgc gcc ggc ggc ate gag act ate get aat gag tat age 
Ala Val Ala Arg Ala Gly Gly He Glu Thr lie Ala Asn Glu Tyr Ser 
260 265 270 

gac cgc tgc aeg ccg get tgc att tet ttt ggt cet aag aat egt tea 
Asp Arg Cys Thr Pro Ala Cys He Ser Phe Gly Pro Lys Asn Arg Ser 
275 280 285 

att gga gca gea get aaa age eag gta att tet aat gca aag aac aea 
He Gly Ala Ala Ala Lys Ser Gin Val He Ser Asn Ala Lys Asn Thr 
290 295 300 

gtc caa gga ttt aaa aga ttc eat gge cga gca ttc tet gat eca ttt 
Val Gin Gly Phe Lys Arg Phe His Gly Arg Ala Phe Ser Asp Pro Phe 
305 310 315 320 

gtg gag gca gaa aaa tet aac ett gca tat gat att gtg eag tgg cet 
Val Glu Ala Glu Lys Ser Asn Leu Ala Tyr Asp He Val Gin Trp Pro 
325 / 330 335 

aea gga tta aea ggt ata aag gtg aea tat atg gag gaa gag cga aat 
Thr Gly Leu Thr Gly He Lys Val Thr Tyr Met Glu Glu Glu Arg Asn 
340 345 350 



gtt ect tgt ttc tat act gat gca gaa aga cga tea gtg atg gat gea 
Val Pro Cys Phe Tyr Thr Asp Ala Glu Arg Arg Ser Val Met Asp Ala 
385 390 395 400 

aea eag att get ggt ett aat tgc ttg cga tta atg aat gaa acc act 
Thr Gin He Ala Gly Leu Asn Cys Leu Arg Leu Met Asn Glu Thr Thr 
405 410 415 

gea gtt get ett gca tat gga ate tat aag eag gat ett ect cgc tta 
Ala Val ^Ala Leu Ala Tyr Gly He Tyr Lys Gin Asp Leu Pro Arg Leu 



135 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



ttt ace act gag caa gtg act gee atg ett ttg tee aaa ctg aag gag 1104 
Phe Thr Thr Glu Gin Val Thr Ala Met Leu Leu Ser Lys Leu Lys Glu 
355 360 365 

aea gcc gaa agt gtt ett aag aag ect gta gtt gac tgt gtt gtt teg 1152 
Thr Ala Glu Ser Val Leu Lys Lys Pro Val Val Asp Cys Val Val Ser 
370 375 380 
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420 425 430 

gaa gag aaa cca aga aat gta gtt ttt gta gac atg ggc cac tct get 1344 
Glu Glu Lys Pro Arg Asn Val Val Phe Val Asp Met Gly His Ser Ala 
435 440 445 

tat caa gtt tct gta tgt gca ttt aat aga gga aaa ctg aaa gtt ctg 13 92 
Tyr Gin Val Ser Val Cys Ala Phe Asn Arg Gly Lys Leu Lys Val Leu 
450 455 460 

gcc act gca ttt gac acg aca ttg gga ggt aga aaa ttt gat gaa gtg 1440 
Ala Thr Ala Phe Asp Thr Thr Leu Gly Gly Arg Lys Phe Asp Glu Val 
465 470 475 4B0 

tta gta aat cac ttc tgt gaa gaa ttt ggg aag aaa tac aag eta gac 14 88 
Leu Val Asn His Phe Cys Glu Glu Phe Gly Lys Lys Tyr Lys Leu Asp 
485 490 495 

att aag tec aaa ate cgt gca tta tta cga etc tct eag gag tgt gag 1536 
lie Lys Ser Lys lie Arg Ala Leu Leu Arg Leu Ser Gin Glu Cys Glu 
500 505 510 

aaa etc aag aaa ttg atg agt gca aat get tea gat etc cct ttg age 1584 
Lys Leu Lys Lys Leu Met Ser Ala Asn Ala Ser Asp Leu Pro Leu Ser 
515 520 525 

att gaa tgt ttt atg aat gat gtt gat gta tct gga act atg aat aga 1632 
lie Glu Cys Phe Met Asn Asp Val Asp Val Ser Gly Thr Met Asn Arg 
530 535 540 

ggc aaa ttt ctg gag atg tgc aat gat etc tta get aga gtg gag cca 1680 
Gly Lys Phe Leu Glu Met Cys Asn Asp Leu Leu Ala Arg Val Glu Pro 
545 550 555 560 

cca ctt cgt agt gtt ttg gaa caa acc aag tta aag aaa gaa gat att 1728 
Pro Leu Arg Ser Val Leu Glu Gin Thr Lys ieu Lys Lys Glu Asp lie 
565 570 575 

tat gca gtg gag ata gtt ggt ggt get aca cga ate cct gcg gta aaa 1776 
Tyr Ala Val Glu He Val Gly Gly Ala Thr Arg He Pro Ala Val Lys 
580 585 590 

gag aag ate age aaa ttt ttc ggt aaa gaa ctt agt aca aca tta aat 1824 
Glu Lys He Ser Lys Phe Phe Gly Lys Glu Leu Ser Thr Thr Leu Asn 
595 600 605 

get gat gaa get gtc act cga ggc tgt gca ttg eag tgt gcc ate tta 1872 
Ala Asp Glu Ala Val Thr Arg Gly Cys Ala Leu Gin Cys Ala He Leu 
610 615 620 

teg cct get ttc aaa gtc aga gaa ttt tct ate act gat gta gta cca 1920 
Ser Pro Ala Phe Lys Val Arg Glu Phe Ser He Thr Asp Val Val Pro 
625 630 635 640 

tat cca ata tct ctg aga tgg aat tct cca get gaa gaa ggg tea agt 1968 
Tyr Pro He Ser Leu Arg Trp Asn Ser Pro Ala Glu Glu Gly Ser Ser 
645 650 655 



136 



wo 00/50872 



PCT/USOO/04794 



gac tgt gaa gtc ttt tec aaa aat cat get get cct ttc tct aaa gtt 
Asp Cys Qlu Val Phe Ser Lys Asn His Ala Ala Pro Phe Ser Lys Val 
660 665 670 

ctt aca ttt tat aga aag gaa cct ttc act ctt gag gcc tac tac age 
Leu Thr Phe Tyr Arg Lys Glu Pro Phe Thr Leu Glu Ala Tyr Tyr Ser 
675 680 685 



gtt cag aaa gtc act cet cag tct gat ggc tec agt tea aaa gtg aaa 
val Gin Lys Val Thr Pro Gin Ser Asp Gly Ser Ser Ser Lys Val Lys 



705 710 



715 720 



gtc aaa gtt cga gta aat gtc cat ggc att ttc agt gtg tec agt gca 
Val LVS Val Arg Val Asn Val His Gly lie Phe Ser Val Ser Ser Ala 
725 730 735 

tct tta gtg gag gtt cac aag tct gag gaa aat gag gag cca atg gaa 
Ser Leu Val Glu Val His Lys Ser Glu Glu Asn Glu Glu Pro Met Glu 
740 745 750 

aca gat cag aat gca aag gag gaa gag aag atg caa gtg gae cag gag 
Thr Asp Gin Asn Ala Lys Glu Glu Glu Lys Met Gin Val Asp Gin Glu 
755 760 765 

gaa cca cat gtt gaa gag caa cag cag cag aca cca gca gaa aat aag 
Glu Pro His Val Glu Glu Gin Gin Gin Gin Thr Pro Ala Glu Asn Lys 
770 775 780 

gca gag tct gaa gaa atg gag acc ;tct caa get gga tec aag gat aaa 
Ala Glu Ser Glu qlu Met Glu Thr Ser Gin Ala Gly Ser Lys Asp Lys 
785 790 .795 800 

aag atg gac caa cca cec caa tgc caa gaa ggc aaa agt gaa gac cag 
Lys Met Asp Gin Pro Pro Gin Cys Gin Glu Gly Lys Ser Glu Asp Gin 
805 810 815 

tac tgt gga cct gcc aat cga gaa tea get ata tgg cag ata gae aga 
Tvr CVS Gly Pro Ala Asn Arg Glu Ser Ala lie Trp Gin He Asp Arg 
820 825 830 

gag atg etc aac ttg tac att gaa aat gag ggt aag atg ate atg cag 
Glu Met Leu Asn Leu Tyr He Glu Asn Glu Gly Lys Met He Met Gin 
835 840 845 

gat aaa ctg gag aag gag egg aat gat get aag aac gca gtg gag gaa 
Asp Lys Leu Glu Lys Glu Arg Asn Asp Ala Lys Asn Ala Val Glu Glu 
850 855 860 

tat gtg tat gaa atg aga gac aag ctt agt ggt gaa tat gag. aag ttt 
Tyr Val Tyr Glu Met Arg Asp Lys Leu Ser Gly Glu Tyr Glu Lys Phe 
865 870 875 880 



137 



2016 



2064 



tct cet cag gat ttg cec tat cca gat cct get ata get cag ttt tea 2112 
Ser Pro Gin Asp Leu Pro Tyr Pro Asp Pro Ala He Ala Gin Phe Ser 
690 695 700 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 
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gtg agt gaa gat gat cgt aac agt ttt act ttg aaa ctg gaa gat act 2 68B 

Val Ser Glu Asp Asp Arg Asn Ser Phe Thr Leu Lys Leu Glu Asp Thr 
885 890 695 

gaa aat tgg ttg tat gag gat gga gaa gac cag oca aag caa gtt tat 2736 

Glu Asn Trp Leu Tyx Glu Asp Gly Glu Asp Gin Pro Lys Gin Val Tyr 
900 905 910 

gtt gat aag ttg get gaa tta aaa aat eta ggt caa cct att aag ata 2 7 84 

Val Asp Lys Leu Ala Glu Leu Lys Asn Leu Gly Gin Pro lie Lys lie 

915 920 925 



cgt ttc cag gaa tct gaa gaa cga cca aat tat ttg aag 
Arg Phe Gin Glu Ser Glu Glu Arg Pro Asn Tyr Leu Lys 
930 935 940 



<210> 172 
<211> 941 
<2X2> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSP7 0 
<400> 172 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 / 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp .Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
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165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Met Ser Val Val Gly He Asp Leu Gly Phe Gin Ser Cys Tyr Val 
245 250 255 

Ala Val Ala Arg Ala Gly Gly He Glu Thr He Ala Asn Glu Tyr Ser 
260 265 270 

Asp Arg Cys Thr Pro Ala Cys He Ser Phe Gly Pro Lys Asn Arg Ser 
275 280 285 

He Gly Ala Ala Ala Lys Ser Gin Val He Ser Asn Ala Lys Asn Thr 
290 295 300 

Val Gin Gly Phe Lys Arg Phe His Gly Arg Ala Phe Ser Asp Pro Phe 
305 310 315 320 

Val Glu Ala Glu Lys Ser Asn Leu Ala Tyr Asp He Val Gin Trp Pro 
325 330 335 

Thr Gly Leu Thr Gly He Lys Val Thr Tyr Met Glu Glu Glu Arg Asn 
340 345 ^ 350 

Phe Thr Thr Glu Gin Val Thr Ala Met Leu .Leu Ser Lys Leu Lys Glu 
355 360 365 

Thr Ala Glu Ser Val Leu Lys Lys Pro Val Val Asp Cys Val Val Ser 
370 375 380 

Val Pro Cys Phe Tyr Thr Asp Ala Glu Arg Arg Ser Val Met Asp Ala 
3B5 390 395 400 

Thr Gin He Ala Gly Leu Asn Cys Leu Arg Leu Met Asn Glu Thr Thr 
405 410 415 

Ala Val Ala Leu Ala Tyr Gly He Tyr Lys Gin Asp Leu Pro Arg Leu 
420 . 425 430 

Glu Glu Lys Pro Arg Asn Val Val Phe Val Asp Met Gly His Ser Ala 
435 440 445 

Tyr Gin Val Ser Val Cys Ala Phe Asn Arg Gly Lys Leu Lys Val Leu 
450 455 460 

Ala Thr Ala Phe Asp Thr Thr Leu Gly Gly Arg Lys Phe Asp Glu Val 
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465 470 475 4B0 

Leu Val Asn His Phe Cys Glu Glu Phe Gly Lys Lys Tyr Lys Leu Asp 
485 490 495 

lie Lys Ser Lys lie Arg Ala Leu Leu Arg Leu Ser Gin Glu Cys Glu 
500 505 510 

Lys Leu Lys Lys Leu Met Ser Ala Asn Ala Ser Asp Leu Pro Leu Ser 
515 520 525 

lie Glu Cys Phe Met Asn Asp Val Asp Val Ser Gly Thr Met Asn Arg 
530 535 540 

Gly Lys Phe Leu Glu Met Cys Asn Asp Leu Leu Ala Arg Val Glu Pro 
545 550 555 560 

Pro Leu Arg Ser Val Leu Glu Gin Thr Lys Leu Lys Lys Glu Asp lie 
565 570 575 

Tyr Ala Val Glu lie Val Gly Gly Ala Thr Arg lie Pro Ala Val Lys 
580 585 590 

Glu Lys lie Ser Lys Phe Phe Gly Lys Glu Leu Ser Thr Thr Leu Asn 
595 600 605 

Ala Asp Glu Ala Val Thr Arg Gly Cys- Ala Leu Gin Cys Ala lie Leu 
610 615 620 

Ser Pro Ala Phe Lys Val Arg Glu Phe Ser lie Thr Asp Val Val Pro 
625 630 635 640 

Tyr Pro lie Ser Leu Arg Trp Asn Ser Pro Ala Glu Glu Gly Ser Ser 
645 650 / 655 

Asp Cys Glu Val Phe Ser Lys Asn His Ala Ala Pro Phe Ser Lys Val 
660 665 670 

Leu Thr Phe Tyr Arg Lys Glu Pro Phe Thr Leu Glu Ala Tyr Tyr Ser 
675 680 685 

Ser Pro Gin Asp Leu Pro Tyr Pro Asp Pro Ala lie Ala Gin Phe Ser 
690 695 700 

Val Gin Lys Val Thr Pro Gin Ser Asp Gly Ser Ser Ser Lys Val Lys 
705 710 715 720 

Val Lys Val Arg Val Asn Val His Gly He Phe Ser Val Ser Ser Ala 
725 730 735 

Ser Leu Val Glu Val His Lys Ser Glu Glu Asn Glu Glu Pro Met Glu 
740 745 750 

Thr Asp Gin Asn Ala Lys Glu Glu Glu Lys Met Gin Val Asp Gin Glu 
755 760 765 

Glu Pro His Val Glu Glu Gin Gin Gin Gin Thr Pro Ala Glu Asn Lys 
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770 775 780 

Ala Glu Ser Glu Glu Met Glu Thr Ser Gin Ala Gly Ser Lys Asp Lys 
785 790 795 800 

Lys Met Asp Gin Pro Pro Gin Cys Gin Glu Gly Lys Ser Glu Asp Gin 
805 810 815 

Tyr Cys Gly Pro Ala Asn Arg Glu Ser Ala lie Trp Gin He Asp Arg 
820 825 830 

Glu Met Leu Asn Leu Tyr He Glu Asn Glu Gly Lys Met He Met Gin 
835 840 845 

Asp Lys Leu Glu Lys Glu Arg Asn TVsp Ala Lys Asn Ala Val Glu Glu 
850 855 860 

Tyr Val Tyr Glu Met Arg Asp Lys Leu Ser Gly Glu Tyr Glu Lys Phe 
865 870 875 880 

Val Ser Glu Asp Asp Arg Asn Ser Phe Thr Leu Lys Leu Glu Asp Thr 
885 890 895 

Glu Asn Trp Leu Tyr Glu Asp Gly Glu Asp Gin Pro Lys Gin Val Tyr 
900 905 910 

Val Asp Lys Leu Ala Glu Leu Lys Asn Leu Gly Gin Pro He Lys He 
915 920 ' 925 

Arg Phe Gin Glu Ser Glu Glu Arg Pro Asn Tyr Leu Lys 
930 935 940 



<210> 173 / 

<211> 2674 

<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: GFP-HSC70 

<220> 

<221> CDS 

<222> (1) . . (2673) 

<400> 173 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly.. His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 
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tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 2 40 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 2 88 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Qlu 
85 ' 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 3 36 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 4 80 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 52 8 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg ace gcc gee ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct atg tec aag gga ect gea gtt ggt att gat ctt ggc 768 
Gly Leu Arg Ser Met Ser Lys Gly Pro Ala Val Gly He Asp Leu Gly 
245 250 255 

ace acc tac tct tgt gtg ggt gtt ttc cag cac gga aaa gtc gag ata 816 
Thr Thr Tyr Ser Cys Val Gly Val Phe Gin His Gly Lys Val Glu He 
260 265 270 
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att 
He 


gcc 
Ala 


aat 
Asn 
275 


gat 
Asp 


cag 
Gin 


gga 
Gly 


aac 
Asn 


cga 
Arg 
280 


acc 
Thr 


act 
Thr 


cea 
Pro 


age 
Ser 


tat 
Tyr 
285 


gtc 
Val 


gcc 
Ala 


ttt 
Phe 


864 


acg 
Thr 


gac 
Asp 
290 


act 
Thr 


gaa 
Glu 


egg 

Arg 


ttg 
Leu 


ate 
He 
295 


ggt 
Gly 


gat 
Asp 


gcc 
Ala 


gca 
Ala 


aag 
Lye 
300 


aat 
Asn 


caa 
Gin 


gtt 
Val 


gca 
Ala 


912 


atg 
Met 
305 


aac 
Asn 


ccc 
Pro 


acc 
Thr 


aac 
Asn 


aca 
Thr 
310 


gtt 

Val 


ttt 
Phe 


gat 
Asp 


gcc 
Ala 


aaa 
Lys 
315 


cgt 
Arg 


etg 
Leu 


att 
He 


gga cge 
Gly Arg 
320 


960 


aga 
Arg 


ttt 
Phe 


gat 
Asp 


gat 
Asp 


get 
Ala 
325 


gtt 
Val 


gtc 
Val 


cag 
Gin 


tct 
Ser 


gat 
Asp 
330 


atg 
Met 


aaa 
Lys 


cat 
His 


tgg 
Trp 


ccc 
Pro 
335 


ttt 
Phe 


1008 


atg 
Met 


gtg 
Val 


gtg 
Val 


aat 
Asn 
340 


gat 
Asp 


get 
Ala 


ggc 
Gly 


agg 
Arg 


ccc 
Pro 
345 


aag 
Lys 


gtc 
Val 


caa 
Gin 


gta 
Val 


gaa 
Glu 
350 


tac 
Tyr 


aag 
Lys 


1056 


gga 
Gly 


gag 
Glu 


acc 
Thr 
355 


aaa 
Lys 


age 
Ser 


ttc 
Phe 


tat 
Tyr 


cea 
Pro 
360 


gag 
Glu 


gag 
Glu 


gtg 
Val 


tct 
Ser 


tct 
Ser 
365 


atg 
Met 


gtt 
Val 


etg 
Leu 


1104 


aca 
Thr 


aag 
Lys 
370 


atg 
Met 


aag 
Lys 


gaa 
Glu 


att 
He 


gca 
Ala 
375 


gaa 
Glu 


gee 
Ala 


tac 
Tyr 


ett 
Leu 


ggg 

Gly 
380 


aag 
Lys 


act 
Thr 


gtt 
Val 


acc 
Thr 


1152 


aat 
Asn 
385 


get 
Ala 


gtg 
Val 


gtc 
Val 


aca 
Thr 


gtg 
Val 
390 


cea 
Pro 


get 
Ala 


tac 
Tyr 


ttt 
Phe 


aat 
Asn 
395 


gac 
Asp 


tct 
Ser 


cag 
Gin 


cgt 
Arg 


cag 
Gin 
400 


1200 


get 
Ala 


acc 
Thr 


aaa 
Lys 


gat 
Asp 


get 
Ala 
405 


gga 
Gly 


act 
Thr 


att 
He 


get 
Ala 


ggt 
Gly 
410 


etc 
Leu 


aat 
Asn 


gta 
Val 


ett 
Leu 


aga 
Arg 
415 


atm 
He 


1248 


att 
He 


aat 
Asn 


gag 
Glu 


cea 
Pro 
420 


act 
Thr 


get 
Ala 


get 
Ala 


get 
Ala 


att 
He 
425 


get 
Ala 


tac 
Tyr 


ggc 
Gly 


tta 
Leu 


gac 
Asp 
430 


aaa 
Lys 


aag 
Lys 


1296 


gtt 
Val 


gga 
Gly 


gca 
Ala 
435 


gaa 
Glu 


aga 
Arg 


aac 
Asn 


gtg 
Val 


etc 
Leu 
440 


ate 
He 


ttt 
Phe 


gac 
Asp 


etg 
Leu 


gga 

Gly 
445 


ggt 

Gly 


ggc 
Gly 


act 
Thr 


1344 


ttt 
Phe 


gat 
Asp 
450 


gtg 
Val 


tea 
Ser 


ate 
He 


etc 
Leu 


act 
Thr 
455 


att 
He 


gag 
Glu 


gat 
Asp 


gga 

Gly 


ate 
He 
460 


ttt 
Phe 


gag 

Glu 


gtc 
Val 


aag 
Lys 


1392 


tct 

Q £5 T~ 

465 


aca 


get 

Ala 


gga 

vj J- y 


gac 

Asp 


acc 
Thr 
470 


cac 
His 


ttg 
Leu 


ggt 
Glv 


gga 
Glv 


gaa 
Glu 
475 


gat 
Asp 


ttt 
Phe 


gac 
Asp 


aac 
Asn 


cga 
Arg 
480 


1440 


atg 
Met 


gtc 
Val 


aac 
Asn 


cat 
His 


ttt 

Phe 
485 


att 
He 


get 
Ala 


gag 
Glu 


ttt 

Phe 


aag 
Lys 
490 


cge 
Arg 


aag 
Lys 


cat 
His 


aag 
Lys 


aag 
Lys 
495 


gac 
Asp 


1488 


ate 


agt 


gag 


aac 


aag 


aga 


get 


gta 


aga 


cge 


etc 


cgt 


act 


get 


tgt 


gaa 


1536 
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He Ser Glu Asn Lys Arg Ala Val Arg Arg Leu Arg Thr Ala Cys Glu 
500 505 510 

cgt get aag cgt acc etc tct tec age acc cag gcc agt att gag ate 1584 
Arg Ala Lys Arg Thr Leu Ser Ser Ser Thr Gin Ala Ser He Glu He 
515 520 525 

gat tct etc tat gaa gga ate gac ttc tat acc tec att acc cgt gcc 1632 
Asp Ser Leu Tyr Glu Gly He Asp Phe Tyr Thr Ser He Thr Arg Ala 
530 535 540 

cga ttt gaa gaa ctg aat get gac ctg ttc cgt ggc acc ctg gac cca 1680 
Arg Phe Glu Glu Leu Asn Ala Asp Leu Phe Arg Gly Thr Leu Asp Pro 
545 550 555 560 

gta gag aaa gcc ctt cga gat gcc aaa eta gac aag tea cag att cat 
Val Glu Lys Ala Leu Arg Asp Ala Lys Leu Asp Lys Ser Gin He His 
565 570 575 

gat att gtc ctg gtt ggt ggt tct act cgt ate ccc aag att cag aag 
Asp He Val Leu Val Gly Gly Ser Thr Arg He Pro Lys He Gin Lys 
5B0 585 590 

ctt etc caa gac ttc ttc aat gga aaa gaa ctg aat aag age ate aac 
Leu Leu Gin Asp Phe Phe Asn Gly Lys Glu Leu Asn Lys Ser He Asn 
595 600 605 

cct gat gaa get gtt get tat ggt gca get gtc cag gca gcc ate ttg 1872 
Pro Asp Glu Ala Val Ala Tyr Gly Ala Ala Val Gin Ala Ala He Leu 
610 615 620 



tct gga gac aag tct gag aat gtt caa gat ttg ctg etc ttg gat gtc 
Ser Gly Asp Lys Ser Glu Asn Val Gin Asp Leu Leu Leu Leu Asp Val 
625 630 635 640 



1728 



1776 



1824 



1920 



act cct ctt tec ctt ggt att gaa act get .ggt gga gtc atg act gtc 1968 
Thr Pro Leu Ser Leu Gly He Glu Thr Ala Gly Gly Val Met Thr Val 
645 650 655 

etc ate aag cgt aat acc acc att cct acc aag cag aca cag acc ttc 2016 
Leu He Lys Arg Asn Thr Thr He Pro Thr Lys Gin Thr Gin Thr Phe 
660 665 670 

act acc tat tct gac aac cag cct ggt gtg ctt att cag gtt tat gaa 2064 
Thr Thr Tyr Ser Asp Asn Gin Pro Gly Val Leu He Gin Val Tyr Glu 
675 680 685 

ggc gag cgt gee atg aca aag gat aac aac ctg ctt ggc aag ttt gaa 2112 
Gly Glu Arg Ala Met Thr Lys Asp Asn Asn Leu Leu Gly Lys Phe Glu 
690 695 700 

etc aca ggc ata cct cct gca ccc cga ggt gtt cct cag att gaa gtc 2160 
Leu Thr Gly He Pro Pro Ala Pro Arg Gly Val Pro Gin He Glu Val 
705 710 715 720 

act ttt gac att gat gcc aat ggt ata etc aat gtc tct get gtg gae 2208 
Thr Phe Asp He Asp Ala Asn Gly He Leu Asn Val Ser Ala Val Asp 
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725 730 735 



aag 
Lys 


agt 
Ser 


acg gga 
Thr Gly 
740 


aaa 
Lys 


gag 
Glu 


aac 
Asn 


aag 
Lys 


att 
He 
745 


act 
Thr 


ate 
He 


act 
Thr 


aat 
Asn 


gac 
Asp 
750 


aag 
Lys 


ggc 
Gly 


2256 


cgt 
Arg 


ttg 
Leu 


age 
Ser 
755 


aag 
Lys 


gaa 
Glu 


gac 
Asp 


att 
He 


gaa cgt 
Glu Arg 
760 


atg 
Met 


gtc 
Val 


cag 
Gin 


gaa 
Glu 
765 


get 
Ala 


gag 

Glu 


aag 
Lys 


2304 


tac 


aaa 
Lys 
770 


get 
Ala 


gaa 
Glu 


gat 

ASD 


gag 
Glu 


aag 
Lys 
775 


cag agg 
Gin Arg 


gac 
Asp 


aag 
Lys 


gtg 
Val 
780 


tea 
Ser 


tec 
Ser 


aag 
Lys 


aat 
Asn 


2352 


tea 
Seir 
785 


ctt 
Leu 


gag 
Glu 


tee 
Ser 


tat 
Tvr 


gee 
Ala 
790 


ttc 
Phe 


aac 
Asn 


atg 
Met 


aaa 
Lys 


gca 
Ala 
795 


act 
Thr 


gtt 
Val 


gaa 
Glu 


gat 
Asp 


gag 
Glu 
800 


2400 


aaa 
Lys 


ctt 
Leu 


caa ggc 
Gin Gly 


aag 
Lvs 
805 


att 
He 


aac gat 
Asn Asp 


gag 
Glu 


gac 
Asp 
810 


aaa 
Lys 


cag 
Gin 


aag 
Lys 


att 

He 


etg 
Leu 
815 


gae 
Asp 


2448 


aag 


tgt 

Cys 


aat 
Asn 


gaa 
Glu 
820 


att 
He 


ate 
He 


aac 
Asn 


tgg 
Trp 


ctt 
Leu 
825 


gat 

Asp 


aag 
Lys 


aat 
Asn 


cag 
Gin 


act 
Thr 
830 


get 
Ala 


gag 
Glu 


2496 


aag 

Lvs 


gaa 
Glu 


gaa 
Glu 
835 


ttt 
Phe 


gaa 
Glu 


cat 
His 


caa 
Gin 


cag 
Gin 
840 


aaa 
Lys 


gag 
Glu 


etg 
Leu 


gag 
Glu 


aaa 
Lys 
845 


gtt 
Val 


tge 
Cys 


aac 
Asn 


2544 


ccc 
Pro 


ate 
He 
850 


ate 
He 


ace 
Thr 


aag 
Lys 


etg 
Leu 


tac 
Tyr 
855 


cag 
Gin 


agt 
Ser 


gea 
Ala 


acra 
Gly 


CIQC 

Gly 
860 


atQ 
Met 


cca gga gga 
Pro Gly Gly 


2592 


atg 
Met 
865 


cct 
Pro 


ggg gga 
Gly Gly 


ttt 
Phe 


cct 
Pro 
870 


ggt ggt gga 
Gly Gly Gly 


get 
Ala 


cct 
•Pro 
875 


CCC 

Pro 


tct 
Ser 


ggt ggt get 
Gly Gly Ala 
880 


2640 


tec 
Ser 


tea 
Ser 


ggg 

Gly 


ccc 
Pro 


aec 
Thr 


att 
He 


gaa 
Glu 


gag 
Glu 


gtt 
Val 


gat 
Asp 


taa 


g 










2674 



885 890 



<210> 174 
<211> 890 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSC70 
<400> 174 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 
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Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys I^eu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 SO 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

- Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 - 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Met Ser Lys Gly Pro Ala Val Gly He Asp Leu Gly 
245 250 255 

Thr Thr Tyr Ser Cys Val Gly Val Phe Gin His Gly Lys Val Glu He 
260 265 270 

He Ala Asn Asp Gin Gly Asn Arg Thr Thr Pro Ser Tyr Val Ala Phe 
275 280 285 

Thr Asp Thr Glu Arg Leu He Gly Asp Ala Ala Lys Asn Gin Val Ala 
290 295 300 

Met Asn Pro Thr Asn Thr Val Phe Asp Ala Lys Arg Leu He Gly Arg 
305 310 315 320 

Arg Phe Asp Asp Ala Val Val Gin Ser Asp Met Lys His Trp Pro Phe 
325 330 335 
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Met Val Val Asn Asp Ala Gly Arg Pro Lys Val Gin Val Glu Tyr Lys 
340 345 350 

Gly Glu Thr Lys Ser Phe Tyr Pro Glu Glu Val Ser Ser Met Val Leu 
355 360 365 

Thr Lys Met Lye Glu lie Ala Glu Ala Tyr Leu Gly Lys Thr Val Thr 
370 375 380 

Asn Ala Val Val Thr Val Pro Ala Tyr Phe Asn Asp Ser Gin Arg Gin 
385 390 395 400 

Ala Thr Lys Asp Ala Gly Thr He Ala Gly Leu Asn Val Leu Arg He 
405 410 415 

He Asn Glu Pro Thr Ala Ala Ala He Ala Tyr Gly Leu Asp Lys Lys 
420 425 430 

Val Gly Ala Glu Arg Asn Val Leu He Phe Asp Leu Gly Gly Gly Thr 
435 440 445 

Phe Asp Val Ser He Leu Thr He Glu Asp Gly He Phe Glu Val Lys 
450 455 460 

Ser Thr Ala Gly Asp Thr His Leu Gly Gly Glu Asp Phe Asp Asn Arg 
465 ■ 470 . 475 480 

Met Val Asn His Phe He Ala Glu Phe Lys Arg Lys His Lys Lys Asp 
485 490 495 

He Ser Glu Asn Lys Arg Ala Val Arg Arg Leu Arg Thr Ala Cys Glu 
500 505 510 

Arg Ala Lys Arg Thr Leu Ser Ser Ser Thr Gin Ala Ser He Glu He 
515 520 525 

Asp Ser Leu Tyr Glu Gly He Asp Phe Tyr Thr Ser He Thr Arg Ala 
530 535 540 

Arg Phe Glu Glu Leu Asn Ala Asp Leu Phe Arg Gly Thr Leu Asp Pro 
545 550 555 560 

Val Glu Lys Ala Leu Arg Asp Ala Lys Leu Asp Lys Ser Gin He His 
565 570 575 

Asp He Val Leu Val Gly Gly Ser Thr Arg He Pro Lys He Gin Lys 
580 585 590 

Leu Leu Gin Asp Phe Phe Asn Gly Lys Glu Leu Asn Lys Ser He Asn 
595 600 605 

Pro Asp Glu Ala Val Ala Tyr Gly Ala Ala Val Gin Ala Ala He Leu 
610 615 620 

Ser Gly Asp Lys Ser Glu Asn Val Gin Asp Leu Leu Leu Leu Asp Val 
625 630 635 640 
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Thr 



Pro Leu Ser Leu 
645 



Gly 



lie Glu Thr Ala 
650 



Gly 



Gly 



Val 



Met Thr Val 
655 



Leu lie Lys Arg Asn Thr Thr lie Pro Thr Lys Gin Thr Gin Thr Phe 
660 665 670 

Thr Thr Tyr Ser Asp Asn Gin Pro Gly Val Leu He Gin Val Tyx Glu 
675 6B0 685 

Gly Glu Arg Ala Met Thr Lys Asp Asn Asn Leu Leu Gly Lys Phe Glu 
690 695 700 

Leu Thr Gly He Pro Pro Ala Pro Arg Gly Val Pro Gin He Glu Val 
705 710 715 720 

Thr Phe Asp He Asp Ala Asn Gly He Leu Asn Val Ser Ala Val Asp 
725 730 735 

Lys Ser Thr Gly Lys Glu Asn Lys He Thr He Thr Asn Asp Lys Gly 
740 745 750 

Arg Leu Ser Lys Glu Asp He Glu Arg Met Val Gin Glu Ala Glu Lys 
-755 760 765 

Tyr Lys Ala Glu Asp Glu Lys Gin Arg Asp Lys Val Ser Ser Lys Asn 
770 775 780 

Ser Leu Glu Ser Tyr Ala Phe Asn Met Lys Ala Thr Val Glu Asp Glu 
785 790 795 800 

Lys Leu Gin Gly Lys He Asn Asp Glu Asp Lys Gin Lys He Leu Asp 
805 810 815 

Lys Cys Asn Glu He He Asn Trp Leu Asp Lys Asn Gin Thr Ala Glu 
820 825 830 

Lys Glu Glu Phe Glu His Gin Gin Lys Glu Leu Glu Lys Val Cys Asn 
835 840 845 

Pro He He Thr Lys Leu Tyr Gin Ser Ala Gly Gly Met Pro Gly Gly 
850 855 860 

Met Pro Gly Gly Phe Pro Gly Gly Gly Ala Pro Pro Ser Gly Gly Ala 
865 870 875 880 

Ser Ser Gly Pro Thr He Glu Glu Val Asp 



<210> 175 
<211> 2458 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSFl 



885 



890 
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<220> 

<221> CDS 

<222> (1) . , (2349) 

<400> 175 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg ace acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 24 0 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 2 88 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc ace ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 33 6 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
lis 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 4 80 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 52 8 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 
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age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Aen Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag tec 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 



ctg acc aag ctg tgg acc etc gtg age gac ceg gac acc gac gcg etc 
Leu Thr Lys Leu Trp Thr Leu Val Ser Asp Pro Asp Thr Asp Ala Leu 
275 280 285 



720 



gga etc aga tct ega get caa get teg aat tct gca gtc gag atg gat 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Val Glu Met Asp 
245 250 255 

ctg ccc gtg ggc ccc ggc gcg gcg ggg ccc age aac gtc ceg gee ttc 816 
Leu Pro Val Gly Pro Gly Ala Ala Gly Pro Ser Asn Val Pro Ala Phe 
260 265 270 



864 



ate tgc tgg age ceg age ggg aac age ttc cac gtg ttc gac cag ggc 912 
He Cys Trp Ser Pro Ser Gly Asn Ser Phe His Val Phe Asp Gin Gly 
290 295 300 

cag ttt gcc aag gag gtg ctg ccc aag tac ttc aag cac aac aac atg 960 
Gin Phe Ala Lys Glu Val Leu Pro Lys Tyr Phe Lys His Asn Asn Met 
^05 310 315 320 

gcc age ttc gtg egg eag etc aac atg tat ggc ttc egg aaa gtg gtc 1008 
Ala Ser Phe Val Arg Gin Leu Asn Met Tyr Gly Phe Arg Lys Val Val 
325 330 335 

cac ate gag eag ggc ggc ctg gtc aag cca gag aga gac gac acg gag 1056 
His lie Glu Gin Gly Gly Leu Val Lys Pro Glu Arg Asp Asp Thr Glu 
340 345 350 

ttc cag cac cca tgc ttc ctg cgt ggc cag gag eag etc ctt gag aac 1104 
Phe Gin His Pro Cys Phe Leu Arg Gly Gin Glu Gin Leu Leu Glu Asn 
355 360 365 

ate aag agg aaa gtg acc agt gtg tec acc ctg aag agt gaa gac ata 1152 
He Lys Arg Lys Val Thr Ser Val Ser Thr Leu Lys Ser Glu Asp He 
370 375 380 

aag ate cgc cag gac age gtc ace aag ctg ctg acg gac gtg cag ctg 1200 
Lys lie Arg Gin Asp Ser Val Thr Lys Leu Leu Thr Asp Val Gin Leu 
365 390 395 400 

atg aag ggg aag eag gag tgc atg gac tec aag etc ctg gcc atg aag 1248 
Met Lys Gly Lys Gin Glu Cys Met Asp Ser Lys Leu Leu Ala Met Lys 
405 410 415 

cat gag aat gag get ctg tgg egg gag gtg gee age ctt egg cag aag 1296 
His Glu Asn Glu Ala Leu Trp Arg Glu Val Ala Ser Leu Arg Gin Lys 
420 425 430 
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cat gcc cag caa cag aaa gtc gtc aac aag etc att cag ttc ctg ate 1344 
His Ala Gin Gin Gin Lys Val Val Asn Lys Leu lie Gin Phe Leu lie 
435 440 445 

tea ctg gtg cag tea aac egg ate etg ggg gtg aag aga aag ate eee 13 92 
Ser Leu Val Gin Ser Asn Arg lie Leu Gly Val Lys Arg Lys lie Pro 
450 455 460 

ctg atg ctg aac gac agt ggc tea gea cat tec atg cec aag tat age 144 0 
Leu Met Leu Asn Asp Ser Gly Ser Ala His Ser Met Pro Lys Tyr Ser 
465 470 475 480 

egg cag ttc tec ctg gag cae gtc cac ggc teg ggc ccc tac teg gee 148 8 
Arg Gin Phe Ser Leu Glu His Val His Gly Ser Gly Pro Tyr Ser Ala 
485 490 495 

ccc tec cca gee tac age age tec age etc tac gcc cet gat get gtg 153 6 
Pro Ser Pro Ala Tyr Ser Ser Ser Ser Leu Tyr Ala Pro Asp Ala Val 
500 505 510 

gee age tct gga ccc ate ate tec gac ate ace gag ctg get ect gcc 1584 
Ala Ser Ser Gly Pro lie lie Ser Asp lie Thr Glu Leu Ala Pro Ala 
515 520 525 

age ccc atg gcc tec ccc ggc ggg age ata gac gag agg ccc eta tec 1632 
Ser Pro Met Ala Ser Pro Gly Gly Ser lie Asp Glu Arg Pro Leu Ser 
530 535 540 

age age ccc ctg gtg cgt gtc aag gag gag cec cec age ccg ect cag 1680 
Ser Ser Pro Leu Val Arg Val Lys Glu Glu Pro Pro Ser Pro Pro Gin 
545 550 555 560 

age ccc egg gta gag gag gcg agt ccc ggg cgc cca tct tec gtg gac 1728 
Ser Pro Arg Val Glu Glu Ala Ser Pro Gly Arg Pro Ser Ser Val Asp 
565 570 / 575 

ace etc ttg tec ccg ace gcc etc att gac tec ate ctg egg gag agt 1776 
Thr Leu Leu Ser Pro Thr Ala Leu lie Asp Ser lie Leu Arg Glu Ser 
580 585 590 

gaa ect gcc cec gcc tec gtc aca gee etc acg gac gee agg ggc cae 1824 
Glu Pro Ala Pro Ala Ser Val Thr Ala Leu Thr Asp Ala Arg Gly His 
595 600 605 

acg gac ace gag ggc egg ect ccc tec cec ccg eee ace tec ace cet .1872 
Thr Asp Thr Glu Gly Arg Pro Pro Ser Pro Pro Pro Thr Ser Thr Pro 
610 615 620 

gaa aag tgc etc age gta gcc tgc ctg gac aag aat gag etc agt gac 1920 
Glu Lys Cys Leu Ser Val Ala Cys Leu Asp Lys Asn Glu Leu Ser Asp 
625 630 635 640 

cac ttg gat get atg gac tec aac ctg gat aac ctg cag ace atg etg 1968 
His Leu Asp Ala Met Asp Ser Asn Leu Asp Asn Leu Gin Thr Met Leu 
645 650 655 

age age cac ggc ttc age gtg gac ace agt gee etg ctg gac etg ttc 2016 
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Sex Ser His Gly Phe Ser Val Asp Thr Ser Ala Leu Leu Asp Leu Phe 
660 665 670 

age ccc teg gtg acc gtg ccc gac atg age ctg cct gac ctt gac age 2064 

Ser Pro Ser Val Thr Val Pro Asp Met Ser Leu Pro Asp Leu Asp Ser 
675 680 685 

age ctg gcc agt ate caa gag cte ctg tct ccc cag gag ccc ccc agg 2112 

Ser Leu Ala Ser He Gin Glu Leu Leu Ser Pro Gin Glu Pro Pro Arg 
690 695 700 

cct ccc gag gca gag aac age age ccg gat tea ggg aag cag ctg gtg 2160 

Pro Pro Glu Ala Glu Asn Ser Ser Pro Asp Ser Gly Lys Gin Leu Val 
705 710 715 720 

cac tac aca gcg cag ccg ctg ttc ctg ctg gac ccc ggc tec gtg gac 2208 

His Tyr Thr Ala Gin Pro Leu Phe Leu Leu Asp Pro Gly Ser Val Asp 

725 730 735 

acc ggg age aac gac ctg ccg gtg ctg ttt gag ctg gga gag ggc tec 2256 

Thr Gly Ser Asn Asp Leu Pro Val Leu Phe Glu Leu Gly Glu Gly Ser 
740 745 750 

tac ttc tec gaa ggg gac ggc ttc gcc gag gac ccc acc ate tec ctg 23 04 

Tyr Phe Ser Glu Gly Asp Gly Phe Ala Glu Asp Pro Thr He Ser Leu 
755 -760 765 

ctg aca ggc teg gag cct ccc aaa gcc aag gac ccc act gtc tec 2349 

Leu Thr Gly Ser Glu Pro Pro Lys Ala Lys Asp Pro Thr Val Ser 
770 775 780 

tagaggcece ggaggagetg ggccagccgc ccacccccac ccccagtgca gggctggtct 2409 

tggggaggca gggeagcctc gcggtcttgg gcactggtgg gtcggecgg 2458 



<210> 176 
<211> 783 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sec[uence: GFP-HSFl 
<400> 176 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 
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Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 30 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys^ Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 , 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Val Glu Met Asp 
245 250 255 

Leu Pro Val Gly Pro Gly Ala Ala Gly Pro Ser Asn Val Pro Ala Phe 
260 265 270 

Leu Thr Lys Leu Trp Thr Leu Val Ser Asp Pro Asp Thr Asp Ala Leu 
275 280 285 

He Cys Trp Ser Pro Ser Gly Asn Ser Phe His Val Phe Asp Gin Gly 
290 295 300 

Gin Phe Ala Lys Glu Val Leu Pro Lys Tyr Phe Lys His Aan Asn Met 
305 310 315 320 

Ala Ser Phe Val Arg Gin Leu Asn Met Tyr Gly Phe Arg Lys Val Val 
325 330 335 

His He Glu Gin Gly. Gly Leu Val Lys Pro Glu Arg Asp Asp Thr Glu 
340 345 350 

Phe Gin His Pro Cys Phe Leu Arg Gly Gin Glu Gin Leu Leu Glu Asn 
355 360 365 
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lie Lys Arg Lys Val Thr Ser Val Ser Thr Leu Lys Ser Glu Asp He 
370 375 380 

Lys lie Arg Gin Asp Ser Val Thr Lys Leu Leu Thr Asp Val Gin Leu 
385 390 395 400 

Met Lys Gly Lys Gin Glu Cys Met Asp Ser Lys Leu Leu Ala Met Lys 
405 410 415 

His Glu Asn Glu Ala Leu Trp Arg Glu Val Ala Ser Leu Arg Gin Lys 
420 425 430 

His Ala Gin Gin Gin Lys Val Val Asn Lys Leu He Gin Phe Leu He 
435 440 445 

Ser Leu Val Gin Ser Asn Arg He Leu Gly Val Lys Arg Lys He Pro 
450 455 460 

Leu Met Leu Asn Asp Ser Gly Ser Ala His Ser Met Pro Lys Tyr Ser 
465 470 475 480 

Arg Gin Phe Ser Leu Glu His Val His Gly Ser Gly Pro Tyr Ser Ala 
485 490 495 

Pro Ser Pro Ala Tyr Ser Ser Ser Ser Leu Tyr Ala Pro Asp Ala Val 
500 505 - 510 

Ala Ser Ser Gly Pro He He Ser Asp He Thr Glu Leu Ala Pro Ala 
515 520 525 

Ser Pro Met Ala Ser Pro Gly Gly Ser He Asp Glu Arg Pro Leu Ser 
530 535 * 540 

Ser Ser Pro Leu Val Arg Val Lys Glu Glu Pro Pro Ser Pro Pro Gin 
545 550 555 560 

Ser Pro Arg Val Glu Glu Ala Ser Pro Gly Arg Pro Ser Ser Val Asp 
565 570 575 

Thr Leu Leu Ser Pro Thr Ala Leu He Asp Ser He Leu Arg Glu Ser 
580 585 590 

Glu Pro Ala Pro Ala Ser Val Thr Ala Leu Thr Asp Ala Arg Gly His 
595 600 605 

Thr Asp Thr Glu Gly Arg Pro Pro Ser Pro Pro Pro Thr Ser Thr Pro 
610 615 620 

Glu Lys Cys Leu Ser Val Ala Cys Leu Asp Lys Asn Glu Leu Ser Asp 
625 630 635 640 

His Leu Asp Ala Met Asp Ser Asn Leu Asp Asn Leu Gin Thr Met Leu 
645 650 655 

Ser Ser His Gly Phe Ser Val Asp Thr Ser Ala Leu Leu Asp Leu Phe 
660 665 670 
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Ser Pro Ser Val Thr Val 
675 

Ser Leu Ala Ser lie Gin 
690 

Pro Pro Glu Ala Glu Asn 
705 710 

His Tyr Thx Ala Gin Pro 
725 

Thr Gly Ser Asn Asp Leu 
740 

Tyr Phe Ser Glu Gly Asp 
755 

Leu Thr Gly Ser Glu Pro 
770 



Pro Asp Met Ser Leu 
680 

Glu Leu Leu Ser Pro 
695 

Ser Ser Pro Asp Ser 
715 

Leu Phe Leu Leu Asp 
730 

Pro Val Leu Phe Glu 
745 

Gly Phe Ala Glu Asp 
760 

Pro Lys Ala Lys Asp 
775 



Pro Asp Leu Asp Ser 
685 

Gin Glu Pro Pro Arg 
700 

Gly Lys Gin Leu Val 
720 

Pro Gly Ser Val Asp 
735 

Leu Gly Glu Gly Ser 
750 

Pro Thr lie Ser Leu 
765 

Pro Thr Val Ser 
780 



<210> 177 
<211> 2416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-NFKB 

<220> 

<221> CDS 

<222> (1) . . (2415) 

<400> 177 

atg gtg age aag ggc gag gag ctg ttc acc ^gg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 



gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 

Cye Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg ace tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
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85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lya Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asm Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cae aac gtc tat ate atg gcc gac aag cag aag aac 4 80 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gee ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct ega gat ccg eee ttc atg gac gaa ctg ttc ccc etc 768 
Gly Leu Arg Ser Arg Asp Pro Pro Phe Met Asp Glu Leu Phe Pro Leu 
245 250 255 

ate ttc ccg gea gag cea gcc cag gcc tct ggc ccc tat gtg gag ate 816 
lie Phe Pro Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu lie 
260 265 270 

att gag cag ccc aag cag egg ggc atg cgc ttc cgc tac aag tgc gag 864 
lie Glu Gin Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys Cys Glu 
275 280 285 

ggg cgc tec gcg ggc age ate cea ggc gag agg age aca gat acc acc 912 
Gly Arg Ser Ala Gly Ser lie Pro Gly Glu Arg Ser Thr Asp Thr Thr 
290 295 300 

aag acc cac ccc ace ate aag ate aat ggc tac aca gga cea ggg aca 960 
Lys Thr His Pro Thr lie Lys lie Asn Gly Tyr Thr Gly Pro Gly Thr 
305 310 315 320 
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gtg cgc ate tec ctg gtc acc aag gac cct cct cac egg cct cac ccc 1008 
Val Arg lie Ser Leu Val Thr Lys Asp Pro Pro His Arg Pro His Pro 
325 330 335 

cac gag ctt gta gga aag gac tgc egg gat ggc ttc tat gag get gag 1056 
His Glu Leu Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu 
340 345 350 

etc tgc ecg gac cgc tgc ate cac agt ttc cag aac ctg gga ate cag 1104 

Leu Cys Pro Asp Arg Cys lie His Ser Phe Gin Asn Leu Gly lie Gin 
355 360 365 

tgt gtg aag aag egg gac ctg gag cag get ate agt cag cgc ate cag 1152 

Cys Val Lys Lys Arg Asp Leu Glu Gin Ala lie Ser Gin Arg lie Gin 
370 375 380 

acc aac aac aac ccc ttc caa gtt cct ata gaa gag cag cgt ggg gac 1200 

Thr Asn Asn Asn Pro Phe Gin Val Pro lie Glu Glu Gin Arg Gly Asp 

385 390 395 400 

tac gac ctg aat get gtg egg etc tgc ttc cag gtg aca gtg egg gac 124 8 

Tyr Asp Leu Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp 
405 410 .415 

eca tea ggc agg ccc etc cgc ctg ecg cct gtc ctt tct eat ccc ate 1296 

Pro Ser Gly Arg Pro Leu Arg Leu Pro Pro Val Leu Ser His Pro lie 
420 425 430 

ttt gac aat cgt gee ccc aac act gee gag etc aag ate tgc ega gtg 1344 

Phe Asp Asn Arg Ala Pro Asn Thr Ala Glu Leu Lys lie Cys Arg Val 
435 440 445 



aac cga aac tct ggc age tgc etc ggt ggg gat gag ate ttc eta ctg 
Asn Arg Asn Ser Gly Ser Cys Leu Gly Gly Asp Glu lie Phe Leu Leu 
450 455 460 



1392 



tgt gac aag gtg cag aaa gag gac att gag gtg tat ttc acg gga eca 1440 
Cys Asp Lys Val Gin Lys Glu Asp lie Glu Val Tyr Phe Thr Gly Pro 
465 470 475 480 

ggc tgg gag gee cga ggc tec ttt teg caa get gat gtg cac cga caa 14 88 
Gly Trp Glu Ala Arg Gly Ser Phe Ser Gin Ala Asp Val His Arg Gin 
485 490 495 

gtg gee att gtg ttc egg acc cct ccc tac gca gac ccc age ctg cag 1536 
Val Ala lie Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin 
500 505 510 

get cct gtg cgt gtc tec atg cag ctg egg egg cct tec gac egg gag 1584 
TQa Pro Val Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu 
515 520 525 

etc agt gag ccc atg gaa ttc cag tac ctg eca gat aca gac gat cgt 1632 
Leu Ser Glu Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg 
530 535 540 
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cac egg att gag gag aaa cgt aaa agg aca tat gag acc ttc aag age 1680 
His Arg lie Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser 
545 550 555 560 

ate atg aag aag agt cct ttc age gga ccc acc gac ccc egg cct cca 1728 
lie Met Lys Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro 
565 570 575 

cet ega cgc att get gtg cct tee cgc age tea get tet gtc ccc aag 1776 
Pro Arg Arg lie Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys 
580 585 590 

eea gea ccc eag eee tat ccc ttt acg tea tec ctg age ace ate aac 1824 
Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr lie Asn 
595 600 605 

tat gat gag ttt ccc acc atg gtg ttt cct tet ggg eag ate age eag 
Tyr Asp Glu Phe Pro Thr Met Val Phe Pro Ser Gly Gin lie Ser Gin 
610 615 620 

gee teg gee ttg gee ecg gee cct ccc caa gtc ctg ccc. eag get cca 
Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro 
625 630 635 640 

gee cct gee cct get eea gee atg gta tea get ctg gee cag gee cca 1968 
Ala Pro Ala Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro 
- 645 650 655 

gee cct gtc cca gtc eta gee cca ggc cct cct cag get gtg gee cca 2016 
Ala Pro Val Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro 
660 665 670 

cct gee eee aag ccc acc cag get ggg gaa gga acg ctg tea gag gee 2064 
Pro Ala Pro Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala 
675 680 685 

ctg ctg eag ctg eag ttt gat gat gaa gac ctg ggg gee ttg ett ggc 2X12 
Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly 
690 695 700 



aac age aca gac cca get gtg ttc aca gac ctg gea tee gtc gac aac 
Asn Ser Thr Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn 
705 710 715 720 

tec gag ttt cag cag ctg ctg aac eag ggc ata cct gtg gee ccc cac 
Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly lie Pro Val Ala Pro His 
725 730 735 



gtg aca gee cag agg ccc ccc gac eea get cct get cca ctg ggg gee 

Val Thr Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala 
755 760 765 

ccg ggg etc ccc aat ggc etc ett tea gga gat gaa gac ttc tec tec 



1872 



1920 



2160 



2208 



aca act gag ccc atg ctg atg gag tac cct gag get ata act cgc eta 2256 
Thr Thr Glu Pro Met Leu Met Glu Tyr Pro Glu Ala lie Thr Arg Leu 
740 745 750 



2304 



2352 
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Pro Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser 
770 775 780 

att gcg gac atg gac ttc tea gcc ctg ctg agt cag ate age tec aag 2400 
lie Ala Asp Met Asp Phe Ser Ala Leu Leu Ser Gin lie Ser Ser Lys 
785 790 795 800 

ggc gaa ttc gaa get t 2416 
Gly Glu Phe Glu Ala 
805 



<210> 178 
<211> 805 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-KTFKB 
<400> 178 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 IBS 190 
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Pro Val Leu Leu Pro 
195 

Ser Lys Asp Pro Asn 
210 

Val Thr Ala Ala Gly 
225 

Gly Leu Arg Ser Arg 
245 

lie Phe Pro Ala Glu 
260 

lie Glu Gin Pro Lys 
275 

Gly Arg Ser Ala Gly 
290 

Lys Thr His Pro Thr 
305 

.Val" Arg lie Ser Leu 
325 

His Glu Leu Val Gly 
340 

Leu Cys Pro Asp Arg 
355 

Cys Val Lys Lys Arg 
370 

Thr Asn Asn Asn Pro 
365 

Tyr Asp Leu Asn Ala 
405 

Pro Ser Gly Arg Pro 
420 



Asp Asn His Tyr Leu Ser 
200 

Glu Lys Arg Asp His Met 
215 

lie Thr Leu Gly Met Asp 
230 235 

Asp Pro Pro Phe Met Asp 
250 

Pro Ala Gin Ala Ser Gly 
265 

Gin Arg Gly Met Arg Phe 
280 

Ser lie Pro Gly Glu Arg 
295 

lie Lys lie Asn Gly Tyr 
310 315 

Val Thr Lys Asp Pro Pro 
330 

Lys Asp Cys Arg Asp Gly 
345 

Cys lie His Ser Phe Gin 
360 

Asp Leu Glu Gin Ala Xle 
375 

Phe Gin Val Pro lie Glu 
390 395 

Val Arg Leu Cys Phe Gin 
410 

Leu Arg Leu Pro Pro Val 
425 



Thr Gin Ser Ala Leu 
205 

Val Leu Leu Glu Phe 
220 

Glu Leu Tyr Lys Ser 
240 

Glu Leu Phe Pro Leu 
255 

Pro Tyr Val Glu He 
270 

Arg Tyr Lys Cys Glu 
285 

Ser Thr Asp Thr Thr 
300 

Thr Gly Pro Gly Thr 
320 

His Arg Pro His Pro 
335 

Phe Tyr Glu Ala Glu 
350 

Asn Leu Gly He Gin 
365 

Ser Gin Arg He Gin 
380 

Glu Gin Arg Gly Asp 
400 

Val Thr Val Arg Asp 
415 

Leu Ser His Pro He 
430 



Phe Asp Asn Arg Ala 
435 

Asn Arg Asn Ser Gly 
450 

Cys Asp Lys Val Gin 
465 

Gly Trp Glu Ala Arg 
465 



Pro Asn Thr Ala Glu Leu 
440 

Ser Cys Leu Gly Gly Asp 
455 

Lys Glu Asp He Glu Val 
470 475 

Gly Ser Phe Ser Gin Ala 
490 



Lys He Cys Arg Val 
445 

Glu He Phe Leu Leu 
460 

Tyr Phe Thr Gly Pro 
480 

Asp Val His Arg Gin 
495 
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Val Ala lie Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin 
500 505 510 

Ala Pro Val Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu 
515 520 525 

Leu Ser Glu Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg 
530 535 540 

His Arg lie Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser 
545 550 555 560 

lie Met Lys Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro 
565 570 575 

Pro Arg Arg lie Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys 
580 585 590 

Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr lie Asn 
595 600 605 

Tyr Asp Glu Phe Pro Thr Met Val Phe Pro Ser Gly Gin lie Ser Gin 
610 615 620 

Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro 
625 . 630 635 640 

Ala Pro Ala Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro 
645 650 655 

Ala Pro Val Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro 
660 665 670 

Pro Ala Pro Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala 
675 680 685 

Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly 
690 695 700 

Asn Ser Thr Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn 
705 710 715 720 

Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His 
725 730 735 

Thr Thr Glu Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu 
740 745 750 

Val Thr Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala 
755 760 765 

Pro Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser 
770 775 780 

He Ala Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser Lys 
785 790 795 800 
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Gly Glu Phe Glu Ala 
805 



<210> 179 
<211> 1677 
<212> DNA 

<213> Artificial Secpience 
<220> 

<223> Description of Artificial Sequence: GFP-IKB 

<220> 

<221> CDS 

<222> (1) . . (1674) 

<400> 179 

atg ttc cag gcg get gag cgc ccc cag gag tgg gcc atg gag ggc ccc 48 . 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 

1 5 10 15 

cgc gac ggg ctg aag aag gag egg eta ctg gac gac cgc cac gac age 96 
Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
20 25 30 

ggc ctg gac tec atg aaa gac gag gag tac gag cag atg gtc aag gag 144 
Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 40 45 

ctg cag gag ate cgc etc gag ccg cag gag gtg ceg cgc ggc teg gag 192 
Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 . 55 60 

ccc tgg aag cag cag etc ace ga^ gae ggg gac teg ttc ctg cac ttg 24 0 
Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

gcc ate ate cat gaa gaa aag gca ctg ace atg gaa gtg ate cgc cag 2 88 
Ala lie lie. His Glu Glu Lys Ala Leu Thr Met Glu Val lie Arg Gin 
85 90 95 

gtg aag gga gac ctg gcc ttc etc aac etc cag aac aac ctg cag cag 336 
Val Lys Gly Asp Leu Ala Phe Leu Asn Leu Gin Asn Asn Leu Gin Gin 
100 105 110 

act cca etc cac ttg get gtg ate ace aac cag cca gaa att get gag 384 
Thr Pro Leu His Leu Ala Val lie Thr Asn Gin Pro Glu lie Ala Glu 
115 120 125 

gca ett ctg gga get ggc tgt gat cet gag etc cga gac ttt cga gga 432 
Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 135 140 

aat acc ccc eta cac ett gcc tgt gag cag ggc tgc ctg gcc age gtg 480 
Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155 160 
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gga gtc ctg act cag tec tgc acc acc ccg cac etc cac tec ate ttg 
Qly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser lie Leu 
165 170 175 



528 



aag get acc aac tac aat ggc cac acg tgt eta cac tta gee tct ate 576 
Lys Ala Thr Asn Tyr Asn Gly His Thr Cys Leu His Leu Ala Ser lie 
180 185 190 

cat ggc tac ctg ggc ate gtg gag ctt ttg gtg tec ttg ggt get gat 624 
His Gly Tyr Leu Gly lie Val Glu Leu Leu Val Ser Leu Gly Ala Asp 
195 200 205 

gtc aat get cag gag cec tgt aat ggc egg act gee ctt cac etc gca 672 
Val Asn Ala Gin Glu Pro Cys Asn Gly Arg Thr Ala Leu His Leu Ala 
210 215 220 

gtg gac ctg caa aat ect gac ctg gtg tea etc ctg ttg aag tgt ggg 
Val Asp Leu Gin Asn Pro Asp Leu Val Ser Leu Leu Leu Lys Cys Gly 
225 230 235 240 

get gat gtc aac aga gtt acc tac cag ggc tat tct cec tac cag etc 
Ala Asp Val Asn Arg Val Thr Tyr Gin Gly Tyr Ser Pro Tyr Gin Leu 
245 250 255 

acc tgg ggc cgc cca age acc egg ata cag cag cag ctg ggc cag ctg 
Thr Trp Gly Arg Pro Ser Thr Arg lie Gin Gin Gin Leu. Gly Gin Leu 
260 265 270 

aca eta gaa aac ctt cag atg ctg cca gag agt gag gat gag gag age 
Thr Leu Glu Asn Leu Gin Met Leu Pro Glu Ser Glu Asp Glu Glu Ser 
275 280 285 

tat gac aca gag tea gag ttc acg gag ttc aca gag gac gag ctg cec 
Tyr Asp Thr Glu Ser Glu Phe Thr Glu Phe Thr Glu Asp Glu Leu Pro 
^ 290 295 / 300 

tat gat gac tgt gtg ttt gga ggc cag cgt ctg acg tta acc ggt atg 
Tyr Asp Asp Cys Val Phe Gly Gly Gin Arg Leu Thr Leu Thr Gly Met 
305 310 315 320 

get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt gtt 
Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
325 330 335 

gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga gag 1056 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
340 345 350 



ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate tgc 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
355 360 365 

act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act ctg 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
370 375 380 

tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa egg 
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Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
385 390 395 400 

cat gac ttt ttc aag agt gcc atg ccc gaa ggt tat gta cag gaa agg 124 8 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
405 410 415 

acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa gtc 1296 
Thr lie Phe Phe Lys Asp. Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
420 425 430 

aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt att 1344 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly lie 
435 440 445 

gac ttc aag gaa gat ggc aac att ctg gga eac aaa ttg gaa tac aac 13 92 
Asp Phe Lys Glu Asp Gly TVsn lie Leu Gly His Lys Leu Glu Tyr Asn 
450 455 460 

tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat gga 1440 
Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
465 470 475 480 

ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age gtt 14 88 
lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser Val 
485 490 495 

caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc cet 1536 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
500 505 510 

gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gee ett teg 1584 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
/ 515 520 525 

aaa gat ccc aac gaa aag aga gac cac atg .gtc ett ctt gag ttt gta 1632 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
530 535 540 

aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tag 1677 
Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn 
545 550 555 



<210> 180 
<211> 558 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-IKB 
<400> 180 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 
1 5 10 15 

Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
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20 25 30 

Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 40 45 

Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 55 60 

Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

Ala lie lie His Glu Glu Lys Ala Leu Thr Met Glu Val lie Arg Gin 
85 90 95 

Val Lys Gly Asp Leu Ala Phe Leu Asn Leu Gin Asn Asn Leu Gin Gin 
100 105 110 

Thr Pro Leu His Leu Ala Val lie Thr Asn Gin Pro Glu lie Ala Glu 
115 120 125 

Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 135 140 

Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155 160 



Gly Val Leu Thr 



Lys Ala Thr Asn 
IBO 

His Gly Tyr Leu 



Val Asn Ala Gin 
210 

Val Asp Leu Gin 
225 

Ala Asp Val Asn 



Thr Trp Gly Arg 
260 

Thr Leu Glu Asn 
275 



Gin Ser Cys Thr 
165 

Tyr Asn Gly His 



Gly lie Val Glu 
200 

Glu Pro Cys Asn 
215 

Asn Pro Asp Leu 
230 

Arg Val Thr Tyr 
245 

Pro Ser Thr Arg 



Leu Gin Met Leu 
280 



Thr Pro His Leu 
170 

Thr Cys Leu His 
185 

Leu Leu Val Ser 



Gly Arg .Thr Ala 
220 

Val Ser Leu Leu 
235 

Gin Gly Tyr Ser 
250 

lie Gin Gin Gin 
265 

Pro Glu Ser Glu 



His Ser lie Leu 
175 

Leu Ala Ser lie 
190 

Leu Gly Ala Asp 
205 

Leu His Leu Ala 



Leu Lys Cys Gly 
240 

Pro Tyr Gin Leu 
255 

Leu Gly Gin Leu 
270 

Asp Glu. Glu Ser 
285 



Tyr Asp Thr Glu Ser Glu Phe Thr Glu Phe Thr Glu Asp Glu Leu Pro 
290 295 300 

Tyr Asp Asp Cys Val Phe Gly Gly Gin Arg Leu Thr Leu Ttir Gly Met 
305 310 315 320 

Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
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325 

Glu Leu Asp Gly Asp 
340 

Gly Glu Gly Asp Ala 
355 

Thr Thr Gly Lys Leu 
370 

Cys Tyr Gly Val Gin 
3B5 

His Asp Phe Phe Lys 
405 

Thr lie Phe Phe Lys 
420 

Lys Phe Glu Gly Asp 
435 

Asp Phe Lys Glu Asp 
450 

Tyr Asn Ser His Asn 
465 

lie Lys Val Asn Phe 
485 

Gin Leu Ala Asp His 
500 

Val Leu Leu Pro Asp 
515 

Lys Asp Pro Asn Glu 
530 

Thr Ala Ala Gly He 
545 



330 

Val Asn Gly His Lys Phe 
345 

Thr Tyr Gly Lys Leu Thr 
360 

Pro Val Pro Trp Pro Thr 
375 

Cys Phe Ser Arg Tyr Pro 
390 395 

Ser Ala Met Pro Glu Gly 
410 

Asp Asp Gly Asn Tyr Lys 
425 

Thr Leu Val Asn Arg He 
440 

Gly Asn He Leu Gly His 
455 

Val Tyr He Met Ala Asp 
470 475 

Lys Thr Arg His Asn He 
490 

Tyr Gin Gin Asn Thr Pro 
505 

Asn His Tyr Leu Ser Thr 
520 

Lys Arg App His Met Val 
535 

Thr His Gly Met Asp Glu 
550^ 555 



335 

Ser Val Ser Gly Glu 
350 

Leu Lys Phe He Cys 
365 

Leu Val Thr Thr Leu 
380 

Asp His Met Lys Arg 
400 

Tyr Val Gin Glu Arg 
415 

Thr Arg Ala Glu Val 
430 

Glu Leu Lys Gly He 
445 

Lys Leu Glu Tyr Asn 
460 

Lys Gin Lys Asn Gly 
480 

Glu Asp Gly Ser Val 
495 

He Gly Asp Gly Pro 
510 

Gin Ser Ala Leu Ser 
525 

Leu Leu Glu Phe Val 
540 

Leu Tyr Asn 
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A SYSTEM FOR CELL-BASED SCREENING 

5 Cross Reference 

This application claims priority to U.S. Provisional Applications for Patent 
Serial Nos. 60/122,152 (February 26, 1999), 60/123,399 (March 8, 1999), 09/352,141, 
(July 12, 1999), 60/151,797 (August 31, 1999), 60/168,408 (December 1, 1999); and is 
a continuation in part of 09/430,656 (October 29, 1999); 09/398,965 filed September 
10 17, 1999 which is a continuation in part of Serial No. 09/031,271 filed Febmary 27, 
1998 which is a continuation in part of U.S. Application S/N 08/810983, filed on 
February 27, 1997. 

Field of The Invention 

15 

This invention is in the field of fluorescence-based cell and molecular 
biochemical assays for drug discovery. 

Background of the Invention 

20 , / 

Drug discovery, as currently practiced in the art, is a long, multiple step process 
involving identification of specific disease targets, development of an assay based on a 
specific target, validation of the assay, optimization and automation of the assay to 
produce a screen, high throughput screening of compound libraries using the assay to 

25 identify "hits", hit validation and hit compound optimization. The output of this 
process is a lead compound that goes into pre-clinical and, if validated, eventually into 
clinical trials. In this process, the screening phase is distinct fi-om the assay 
development phases, and involves testing compound efficacy in living biological 
systems. 

30 Historically, drug discovery is a slow and costiy process, spanning mmierous 

years and consuming hundreds of millions of dollars per drug created. Developments 
hi the areas of genomics and high throughput screening have resulted in uacreased 
capacity and efficiency in the areas of target identification and volume of compounds 
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HistoricaUy, drug discovery is a slow and costly process, spanning numerous 
years and consuming hundreds of miUions of dollars per drug created. Developments 
in the areas of genomics and high throughput screening have resulted in increased 
capacity and efficiency in the areas of target identification and volume of compounds 
5 screened. Significant advances in automated DNA sequencing, PGR application, 
positional cloning, hybridization arrays, and bioinformatics have greatly increased the 
number of genes (and gene fi-agments) encoding potential drug screening targets. 
However, the basic scheme for drug screening remians the same. 

VaUdation of genomic targets as points for therapeutic intervention using the 

10 existing methods and protocols has become a bottleneck in the drug discovery process 
due to the slow, manual methods employed, such as in vivo functional models, 
functional analysis of recombinant proteins, and stable cell Line expression of candidate 
genes. Primary DNA sequence data acquired through automated sequencing does riot 
permit identification of gene function, but can provide infomiation about conamon 

15 "motifs" and specific gene homology when compared to known sequence databases. 
Genomic methods such as subtraction hybridization and RADE (r^id amplification of 
differential expression) can be used to identify genes that are up or down regulated in a 
disease state model. However, identification and validation still proceed down the same 
pathway. Some proteomic methods use protein identification (global expression arrays, 

20 2D electrophoresis, combinatorial libraries) in combination with reverse genetics to 
identify candidate genes of interest. Such putative "disease associated sequences" or 
DAS isolated as intact cDNA are a great advantage to these methods, but they are 
identified by the hundreds without providing any information regarding type, activity, 
and distribution of the encoded protein. Choosing a subset of DAS as dmg screening 

25 targets is "random", and thus extremely inefficient, without functional data to provide a 
mechanistic link with disease. It is necessary, therefore, to provide new technologies to 
rapidly screen DAS to establish biological function, thereby improving target validation 
and candidate optimization in drag discovery. 

There are three major avenues for improving early drag discovery productivity. 

30 First, there is a need for tools that provide increased information handling capability. 
Bioinformatics has blossomed with the rapid development of DNA sequencing systems 
and the evolution of the genomics database. Genomics is beginning to play a critical 
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role in the identification of potential new targets. Proteomics has become indispensible 
in relating structure and function of protein targets in order to predict drug interactions. 
However, the next level of biological complexity is the cell. Therefore, there is a need 
to acquh-e, manage and search multi-dimensional information from cells. Secondly, 
5 there is a need for higher throughput tools. Automation is a key to improving 
productivity as has already been demonstrated in DNA sequencing and high throughput 
primary screening. The instant invention provides for automated systems that extract 
multiple parameter information from cells that meet the need for higher throughput 
tools. The instant invention also provides for miniaturizing the methods, thereby 
10 allowing increased throughput, while decreasing the volumes of reagents and test 
compoimds required in each assay. 

Radioactivity has been the dominant read-out in early drug discovery assays. 
However, the need for more information, higher throughput and miniaturization has 
caused a shift towards using fluorescence detection. Fluorescence-based reagents can 
15 yield more powerful, multiple parameter assays that are higher ia throughput and 
information content and require lower volumes of reagents and test compounds. 
Fluorescence is also safer and less expensive than radioactivity-based methods. 

Screening of cells treated with dyes and fluorescent reagents is well known in 
the art. There is a considerable body of literature related to genetic engineering of xells 
20 to produce fluorescent proteins, such as modified green fluorescent protein (GFP), as a 
reporter molecule. Some properties of wild-type GFP are disclosed by Morise et al. 
{Bioche77iistry 13 (1974), p. 2656-2662), and Ward et al. (Photochem, Photobiol 31 
(1980), p. 611-615). The GFP of the jellyfish Aequorea victoria has an excitation 
maximum at 395 ran and an raiission maximum at 510 nm, and does not require an 
25 exogenous factor for fluorescence activity. Uses for GFP disclosed in the hterature are 
widespread and include the study of gene expression and protein localization (Chalfie 
et al. Science 263 (1994), p. 12501-12504)), as a tool for visualizing subcellular 
organelles (Rizzuto et ah, Curr, Biology^ 5 (1995), p. 635-642)), visualization of protein 
transport along the secretory pathway (Kaether and Gerdes, FEES Letteis 369 (1995), 
30 p. 267-271)), expression in plant cells (Hu and Cheng, FEBS Letters 369 (1995), p. 
331-334)) and Drosophila embryos (Davis et al., Dev, Biology 170 (1995), p. 726- 
729)), and as a reporter molecule fused to another protein of interest (U. S. Patent 

3 
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5 491 084). Sinularly, W096/23898 relates to methods of detecting biologically active 
slbst^ces affecting intracellular processes by utilizing a GFP construct having a 
protein kinase activation site. This patent, and all other patents referenced in this 
application are incorporated by reference in their entirety . 
5 Numerous references are related to GFP proteins in biological systems. For 

example, WO 96/09598 describes a system for isolating cells of interest utilizing the 
expression of a GFP like protein. WO 96/27675 describes the expression of GFP in 
plants. WO 95/21191 describes modified GFP protein expressed m transformed 
organisms to detect mutagenesis. U. S. Patents 5,401,629 and 5.436,128 describe 
10 assays and compositions for detecting and evaluating the intracellular transduction of 
an extraceUular signal using recombinant ceUs that express cell surface receptors and 
contain reporter gene constmcts that include transcriptional regulatory elements that are 
responsive to the activity of cell surface receptors. 

Performing a screen on many thousands of compounds requires parallel 
15 handling and processing of many compounds and assay component reagents. Standard 
high throughput screens ("HTS") use mixtures of compounds and biological reagents 
along with some mdicator compound loaded into arrays of wells in standard microtiter 
plates with 96 or 384 wells. The signal measured from each well, eitha: fluorescence 
emission, optical density, or radioactivity, integrates the signal from all the material in 
20 the well giving an overall population average of all the molecules in the well. 

Science Applications International Corporation (SAIC) 130 Fifth Avenue, 
Seattle, WA. 98109) describes an imaging plate reader. This system uses a CCD 
camera to image the whole area of a 96 well plate. The image is analyzed to calculate 
the total fluorescence per well for all the material in the well. 
25 Molecular Devices, Inc. (Sunn3n'ale, CA) describes a system O^LIPR) which 

uses low angle laser scanning illumination and a mask to selectively excite fluorescence 
within approximately 200 microns of the bottoms of the wells in standard 96 well 
plates in order to reduce background when imaging cell monolayers. This system uses 
a CCD camera to image the whole area of the plate bottom. Although this system 
30 measures signals ori^ating from a cell monolayer at tiie bottom of the well, the signal 
measured is averaged over the area of the well and is therefore stUl considered a 
measurement of the average response of a population of cells. The image is analyzed to 

4 
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calculate the total fluorescence per well for cell-based assays. Fluid deUvery devices 
have also been incorporated into cell based screening systems, such as the FLIPR 
system, m order to mitiate a response, which is then observed as a whole well 
population average response using a macro-imaging system. 

5 In contrast to high throughput screens, various high-content screens ("HCS") 

have been developed to address the need for more detailed infonnation about the 
temporal-spatial dynamics of cell constituents and processes. High-content screens 
automate the extraction of multicolor fluorescence information derived from specific 
fluoTBScence-based reagents incorporated into cells (Giuliano and Taylor (1995), Curr. 

10 Op. Cell Biol. 7:4; Giuliano et al. (1995) Ann. Rev. Biophys. Biomol Struct. 24:405). 
Cells are analyzed using an optical system that can measure spatial, as well as temporal 
dynamics. (Farkas et al. (1993) Ann. Rev. Physiol. 55:785; GmUano et al. (1990) In 
Optical Microscopy for Biology. B. Herman and K. Jacobson (eds.), pp. 543-557. 
Wiley-Liss, New York; Hahn et al (1992) Nature 359:736; Waggoner et al. (1996) 

15 Hum. Pathol. 27:494). The concept is to treat each cell as a "well" that has spatial and 
temporal information on the activities of the labeled constituents. 

The types of biochemical and molecular information now accessible through 
fluorescence-based reagents applied to ce^s include ion concentrations, jnembrane 
potential, specific translocations, enzyme activities, gene expression, as well as the 

20 presence, amounts and patterns of metabolites, proteins, lipids, carbohydrates, and 
nucleic acid sequences (DeBiasio et al., (1996) Mol. Biol. Ce//..7:1259;GiuIiano et al., 
(1995) Ann. Rev. Biophys. Biomol. Struct. 24:405; Heun and Tsien, (1996) Curr. Biol 
6:178). 

High-content screens can be performed on either fixed cells, using fluorescently 
25 labeled antibodies, biological ligands, and/or nucleic acid hybridi2ation probes, or live 
cells using multicolor fluorescent indicators and "biosensors." The choice of fixed or 
live cell screens depends on the specific cell-based assay required. 

Fixed cell assays are the simplest, since an array of initially living cells in a 
microtiter plate format can be treated with various compounds and doses being tested, 
30 then the cells can be fixed, labeled with specific reagents, and measured. No 
environmental control of the cells is required after fixation. Spatial information is 
acquired, but only at one time point. The availability of thousands of antibodies, 

5 
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ligands and nucleic acid hybridization probes that can be applied to cells makes this an 
attractive approach for many types of cell-based screens. The fixation and labeling 
steps can be automated, allowing efficient processing of assays. 

Live cell assays are more sophisticated and powerful, since an array of living 
5 cells containing the desired reagents can be screwed over time, as well as space. 
Environmental control of the cells (temperature, humidity, and carbon dioxide) is 
required during measurement, since the physiological health of the cells must be 
maintained for multiple fluorescence measurements over time. There is a growing Ust 
of fluorescent physiological indicators and "biosensors" that can report changes in 
10 biochemical and molecular activities within cells (Giuhano et al., (1995) Ann. Rev. 
Biophys. BiomoL Struct. 24:405; Hahn et al., (1993) hi Fluorescent and Luminescent 
Probes for Biological Activity. W.T. Mason, (ed.), pp. 349-359, Academic Press, San 
Diego). 

The availabihty and use of fluorescence-based reagents has helped to advance 
15 the development of botii fixed and hve cell high-content screens. Advances in 
instrumentation to automatically extract multicolor, high-content infomiation has 
recently made it possible to develop HCS into an automated tool. An article by Taylor, 
et al. {American Scientist 80 (1992), p. 322-33^) describes many of these methods and 
their applications. For example, Proffitt et. al, {Cytometry 24: 204-213 (1996)) describe 
20 a semi-automated fluorescence digital imaging system for quantifying relative cell 
numbers in situ in a variety of tissue culture plate formats, especially 96-well microtiter 
plates. The system consists of an epifluorescence inverted microscope with a 
motorized stage, video camera, image intensifier, and a microcomputer with a PC- 
Vision digitizer. Turbo Pascal software controls the stage and scans the plate taking 
25 multiple images per well. The software calculates total fluorescence per weU, provides 
for daily calibration, and configures easily for a variety of tissue culture plate formats. 
Thresholding of digital images and reagents which fluoresce only when taken up by 
living cells are used to reduce background fluorescence without removing excess 
fluorescent reagent; 

Scanning confocal microscope imaging (Go et al., (1997) Analytical 
Biochemistry 247:210-215; Goldman et al., (1995) Expenmental Cell Research 
221:311-319) and multiphoton microscope imaging (Denk et al., (1990) Science 
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248:73; Gratton et al., (1994) Proc, of the. Microscopical Society of America, pp. 154- 
155) are also well established methods for acquiring high resolution images of 
microscopic samples. The principle advantage of these optical systems is the very 
shallow depth of focus, which allows features of limited axial extent to be resolved 
5 against the background. For example, it is possible to resolve internal cytoplasmic 
features of adherent cells from the features on the cell surface. Because scanning 
multiphoton imaging requires very short duration pulsed laser systems to achieve the 
high photon flux required, fluorescence lifetrmes can also be measured in these systems 
(Lakowicz et al,, (1992) Anal Biochern, 202:316-330; Gerrittsen et al. (1997), J. of 
10 Fluorescence 7:11-15)), providing additional capability for different detection modes. 
Small, reliable and relatively inexpensive laser systems, such as laser diode pumped 
lasers, are now available to allow midtiphoton confocal microscopy to be applied in a 
fairly routine fashion. 

A combination of the biological heterogeneity of cells in populations (Bright, et 
15 al., (1989). J. Cell Physiol 141:410; GiuUano, (1996) Cell Motil Cytoskel 35:237)) as 
well as the high spatial and temporal frequency of chemical and molecular information 
present within cells, makes it impossible to extract high-content information from 
populations of cells using existing whole microtiter plate readers. No existing high- 
content screening platform has been designed for multicolor, fluorescence-based 
20 screens using cells that are analyzed individually. Similarly, no method is currently 
available that combines automated fluid delivery to arrays of cells for the purpose of 
systematically screening compounds for the abihty to induce a cellular response that is 
identified by HCS analysis, especially from cells grovra. in microtiter plates. 
Furthermore, no method exists in the art combining high throughput well-by-well 
25 measurements to identify "hits" in one assay followed by a second high content cell-by- 
cell measurement on the same plate of only those wells identified as hits. 

The instant invention provides systems, methods, and screens that combine high 
throughput screening (HTS) and high content screening (HCS) that significantly 
improve target vahdation and candidate optimization by combining many cell screening 
30 formats with fluorescence-based molecular reagents and computer-based feature 
extraction, data analysis, and automation, resulting in increased quantity and speed of 
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data collection, shortened cycle times, and, ultimately, faster evaluation of promising 
drug candidates. The instant invention also provides for ininiatmizing the methods, 
thereby allowing increased throughput, while decreasing the volumes of reagents and 
test compounds required in each assay. 

SUMMARY OF THE IN VENTION 

In one aspect, the present invention relates to a method for analyzing cells 
comprising providing ceUs containing fluorescent reporter molecules in an array of 
locations, treating the ceUs in the may of locations - with one or more reagents, 
imaging numerous cells in each location with fluorescence optics, converting the 
optical infomiation into digital data, utiUzing the digital data to detennine the 
distribution, environment or activity of the fluorescently labeled reporter molecules in 
the cells and the distribution of the cells, aad interpreting that information in terms of a 
positive, negative or null effect of the compound being tested on the biological 
function 

In this embodiment, the method rapidly determines the distribution, 
environment, or activity of fluorescratly labeled reporter molecules in cells for the 
purpose of screening large numbers of compounds for^ those that specifically affect 
particular biological functions. The array of locations may be a microtiter plate or a 
microchip which is a microplate having cells in an array of locations. In a preferred 
embodiment, the method includes computerized means for acquiring, processing, 
displaying and storing the data received. In a preferred embodiment, the method 
further comprises automated, fluid dehvery to the arrays of cells. In another preferred 
embodiment, the information obtained from high throughput measurements on the 
same plate are used to selectively perform hi^ content screening on only a subset of 
the cell locations on the plate. 

In another aspect of the present invention, a cell screening system is provided 
that comprises: 

• a high magnification fluorescence optical system having a microscope 
objective. 
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. an XY stage adapted for holding a plate containing an aimy of cells and 
having a means for moving flie plate for proper alignment and focusing on 
the cell arrays; 

• a digital camera; 

5 • a light source having optical means for directmg excitation light to cell 

arrays and a means for directing fluorescent light emitted from the cells to 
the digital camera; and 

• a computer means for receiving and processing digital data from the digital 
camera wherein the computer means includes a digital frame grabber for 

10 receiving the images from the camera, a display for user mteraction and 

display of assay results, digital storage media for data storage and archiving, 
and a means for control, acquisition, processing and display of results. 

In a preferred embodiment, the cell screening systCTi ftirther comprises a 

15 computer screen operatively associated with the computer for displaying data. In 
another preferred embodiment, the computer means for receiving and processing digital 
data from the digital camera stores the data in a bioinformatics data base. In a fiorther 
preferred embodiment, the cell screening system further comprises a reader that 
measures a signal from many or all the weUs in parallel. In another preferred 

20 embodiment, the cell screening system further comprises a mechanical-optical means 
for changing the magnification of the system, to allow changing modes between hi^ 
throughput and high content screening. In another preferred embodiment, the cell 
screening system fturther comprises a chamber and control system to maintain the 
temperature, CO2 concentration and humidity surrounding the plate at levels required to 

25 keep cells alive. In a fiorther preferred embodiment, the cell screening system utilizes a 
confocal scanning illimiination and detection system. 

In another aspect of the present invention, a machine readable storage medium 
comprising a program containing a set of instructions for causing a cell screening 
system to execute procedures for defining the distribution and activity of specific 

30 cellular constituents and processes is provided. In a preferred embodiment, the cell 

screening system comprises a high magnification fluorescence optical system with a 

stage adapted for holding cells and a means for moving the stage, a digital camera, a 

9 
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light source for receiving and processing the digital data j&om the digital camera, and a 
computer means for receiving and processing the digital data jfrom the digital camera. 
Preferred embodiments of the machine readable storage medium comprise programs 
consisting of a set of instructions for causing a cell screening system to execute the 
procedures set forth in Figures 9, 11, 12, 13, 14 or 15. Another preferred embodiment 
comprises a program consisting of a set of instructions for causing a cell screening 
system to execute procedures for detecting the distribution and activity of specific 
cellular constituents and processes. In most preferred embodiments, the cellular 
processes include, but are not limited to, nuclear translocation of a protein, cellular 
hypertrophy, apoptosis, and protease-induced translocation of a protein. 

In another preferred embodiment, a variety of automated cell screening methods 
are provided, including screens to identify compounds that affect transcription factor 
activity, protein kinase activity, cell morphology, microtubule stracture, apoptosis,* 
receptor intemaHzation, and protease-induced translocation of a protein. 

In another aspect, the present invention provides recombinant nucleic acids 
encoding a protease biosensor, comprising: 

a. a first nucleic acid sequence that encodes at least one detectable 
polypeptide signal;- 

b. a second nucleic acid sequence that encodes at least one protease 
recognition site, wherein the second nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal; 
and 

c. a third nucleic acid sequence that encodes at least one reactant target 
sequence, wherein the third nucleic acid sequence is operatively linked to the second 
nucleic acid sequence that encodes the at least one protease recognition site. 

The present invention also provides the recombinant expression vectors capable 
of expressing the recombinant nucleic acids encoding protease biosensors, as well as 
genetically modified host ceUs that are transfected vdth the expression vectors. 

The invention further provides recombinant protease biosensors, comprising 

a. a first domain comprising at least one detectable polypeptide signal; 

b. a second domain comprising at least one protease recognition site; and 

c. a third domain comprising at least one reactant target sequence; 

10 
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wherein the first domain and the third domam are separated by the second 
domain. 

In a further aspect, the present invention involves assays and reagents for 
characterizing a sample for the presence of a toxin. The method comprises the use of 
5 detector, classifier, and . identifier classes of toxin biosensors to provide for various 
. levels of toxin characterization. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a diagram of the components of the cell-based scamiing system. 
10 Figure 2 shows a schematic of the microscope subassembly. 
Figure 3 shows the camera subassembly. 
Figure 4 illustrates cell scanning system process. 

Figure 5 illustrates a user interface showing major functions to guide the user. 
Figure 6 is a block diagram of the two platform architecture of the Dual Mode System 
15 for Cell Based Screening in which one platform uses a telescope lens to read all wells 
of a microtiter plate and a second platform that uses a higher magnification lens to read 
individual cells in a well. 

Figure 7 is a detail of an optical system for a single platform architecture of the Dual 
Mode System for Cell Based Screenmg that uses a moveable ^telescope' lens to read all 
20 wells of a microtiter plate and a moveable Ingher magnification lens to read individual 
cells in a well. 

Figure 8 is an illustration of the fluid delivery system for acquiring kinetic data on the 
Cell Based Screening System. 

Figure 9 is a flow chart of processing step for the cell-based scanning system. 
25 Figure 10 A- J illustrates the strategy of the Nuclear Translocation Assay. 

Figure 11 is a flow chart defining the processing steps in the Dual Mode System for 
Cell Based Screening combining high throughput and high content screening of 
microtiter plates. 

Figure 12 is a flow chart defining the processing steps in the High Throughput mode of 
30 the System for Cell Based Screening. 

Figure 13 is a flow chart defining the processing steps in the High Content mode of the 
System for Cell Based Screening. 

11 
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Figure 14 is a flow chart defining the processmg steps required for acquiring kinetic 
data in the High Content mode of the System for Cell Based Screening. 
Figure 15 is a flow chart defining the processing steps perfomied within a well during 
the acquisition of kinetic data. 

Figure 16 is an example of data firom a known inhibitor of translocation. 
Figure 17 is an example of data firom a known stimulator of translocation. 
Figure 18 illustrates data presentation on a graphical display. 

Figure 19 is an illustration of the data from the High Throughput mode of the System 
for Cell Based Screening, an example of the data passed to the High Content mode, the 
data acquired in the high content mode, and the results of the analysis of that data. 
Figure 20 shows the measurement of a dmg-induced cytoplasm to nuclear 
translocation. 

Figure 21 illustrates a graphical user interface of the measurement shown in Figure 20. 
Figure 22 illustrates a graphical user interface, with data, presentation, of the 
measurement shown in Fig. 20. 

Figure 23 is a graph representing the kinetic data obtained fi-om the measurements 

depicted in Fig. 20. * ^ 

Figure 24 details a high-content screen of drug-induced apoptosis. ^ 

Figure 25. Graphs depicting changes in morphology upon induction of apoptosis. 

Staurosporine (A) and pacUtaxel (B) induce classic nuclear firagmentation in L929 ceUs. 

BBDBC cells exhibit concentration dependent changes in response to staurosporine (C), 

but a more classical response to paclitaxel (D). MCF-7 cells exhibit either nuclear 

condensation (E) or firagmentation (F) in response to staurosporine and paclitaxel, 

respectively. In all' cases, cells were exposed to the compoimds for 30 hours. 

Figure 26 illustrates the dose response of cells to staurosporine in terms of both nuclear 

size and nuclear perimeter convolution. 

Figure 27. Graphs depicting induction of apoptosis by staurosporine and paclitaxel 
leading to changes in peri-nuclear f-actin content. (A, B) Both apoptotic stimulators 
induce dose-dependent increases in f-actin content in L929 cells. (C) In BHK cells, 
staurosporine induces a dose-dependent increase in f-actin, whereas paclitaxel (D) 
produces results that are more variable. (E) MCF-7 cells exhibit either a decrease or 
increase depending on the concentration of staurosporine. (F) Paclitaxel induced 
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Changes in f-actin content were highly variable and not significant. Cells were exposed 
to the compounds for 30 hours. 

Figure 28. Graphs depicting mitochondrial changes in response to induction of 
apoptosis. L929 (A3) and BHK (C,D) cells responded to both staurosporine (A,C) and 

5 paclitaxel (B,D) with increases in mitochondrial mass. MCF-7 cells exhibit either a 
decrease in membrane potential (E. staurosporine) or an increase in mitochondrial mass 
(F, paclitaxel) depending on the stimulus. Cells were exposed to the compounds for 30 
hours. 28G is a graph showing the simultaneous measurement of staurosporine effects 
on mitochondrial mass and mitochondrial potential in BHK cells. 

10 Figure 29 shows the nucleic acid and amino acid sequence for various types of 
protesae biosensor domains. (A) Signal sequences. (B) Protease recognition sites. (C) 
Product/Reactant target sequences 

Figure 30 shows schematically shows some basic organization of domains in the 
^jrotease biosensors of the invention. 
15 Figure 31 is a schematic diagram of a specific 3-domain protease biosensor. 

Figure 32 is a photograph showing the effect of stimulation of apoptosis by cis-platin 
oh BHK cells transfected with an expression vector that expresses the caspase 
biosensor shown in Figure 32. 

Figure 33 is a schematic diagram of a specific 4-domain protease biosensor. 
20 Figure 34 is a schonatic diagram of a specific 4-domain protease biosensor, containing 
a nucleolar localization signal. 

Figure 35 is a schematic diagram of a specific 5-domain protease biosensor. 
Figure 36 shows the differential response in a dual labeling assay of the p38 MAPK 
and NF-kB pathways across three model toxins and two different cell types. 
25 Treatments marked with an asterisk are different from controls at a 99% confidence 
level (p < 0.01). 

nyTAn.TCD nyscRiPTiON ofthf tnvention 

All cited patents, patent appUcations and other references are hereby 
30 incorporated by reference in their entirety. 

As used horein, the following terms have the specified meaning: 
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Markers of cellular domains. Luminescent probes that have high aEBnity for 
specific cellular constituents including specific organelles or molecules. These probes 
can either be small luminescent molecules or fluorescently tagged macromolecules 
used as "labeling reagents", "envu-onmental indicators", or "biosensors." 

5 Labeling reagents. Labeling reagents include, but are not linaited to, 

luminescently labeled macromolecules including fluoiescent protein analogs and 
biosensors, luminescent macromolecular chimeras mcluding those formed with the 
green fluorescent protein and mutants thereof luminescently labeled primary or 
secondary antibodies that react with cellular antigens involved in a physiological 

10 response, luminescent stains, dyes, and other small molecules. 

Markers of cellular translocations. Luminescently tagged macromolecules or 
organelles that move from one cell domain to another during some cellular process or 
physiologicar response. Translocation markers can either simply report location 
relative to the markers of cellular domams or they can also be "biosensors" ttiat report 

15 some biochemical or molecular activity as well. 

Biosensors. Macromolecules consistmg of a biological functional domain and a 
luminescent probe or probes that report the environmaital changes that occur either 
internally or on their surfece. A class of luminescently labeled macromolecules 
designed to sense and report these changes have been termed "fluoresceait-protein 

20 biosensors". The protein component of the biosaisor provides a highly evolved 
molecular recognition moiety. A fluorescent molecule attached to the protein 
component in the proximity of an active site transduces environmeaital changes into 
fluorescence signals that are dfetected using a system witii an appropriate temporal and 
spatial resolution such as the cell scanning system of ttie present invention. Because 

25 the modulation of native protein activity within the living cell is reversible, and because 
fluorescent-protein biosensors can be designed to sense reversible changes in protein 
activity, these biosensors are essentially reusable. 

Disease associated sequences ("DAS"). This term refers to nucleic acid 
sequences identified by standard techniques, such as primary DNA sequence data, 

30 genomic methods such as subtraction hybridization and RADE, and proteomic methods 
in combination with reverse genetics, as being of drug candidate compounds. The term 
does not mean that the sequence is only associated with a disease state. 

14 



BNSDOCIDkWO 00&0872A3 IA> 



wo 00/50872 PCT/USOO/04794 

High content screening (HCS) can be used to measure the effects of drugs on 
complex molecular events such as signal transduction pathways, as well as cell 
functions including, but not limited to, apoptosis, cell division, cell adhesion, 
locomotion, exocytosis, and cell-cell communication. Multicolor fluorescence permits 

5 multiple targets and cell processes to be assayed in a single screen. Cross-correlation 
of cellular responses will yield a wealth of information required for target validation 
and lead optimization. 

In one aspect of the present invention, a cell screening system is provided 
comprising a high magnification fluorescence optical system having a microscope 

10 objective, an XY stage adapted for holding a plate with an array of locations for 
holding cells and having a means for moving the plate to align the locations with the 
microscope objective and a means for moving the plate in the direction to effect 
focusing; a digital camera; a light source having optical means for directing excitation 
light to cells in the array of locations and a means for directing fluorescent light emitted 

15 from the cells to the digital camera; and a computer means for receiving and processing 
digital data from the digital camera wherein the computer means includes: a digital 
frame grabber for receiving the images from the camera, a display for user interaction 
and display of assay results, digital storage media for data storage and archiving, and 
means for control, acquisition, processing and display of results. / 

20 Figure 1 is a schematic diagram of a preferred embodiment of the cell scanning 

system. An inverted fluorescence microscope is used 1, such as a Zeiss Axiovert 
inverted fluorescence microscope >yhich uses standard objectives with magnification of 
1-1 OOx to the camera, and a white light source (e.g. lOOW mercury-arc lamp or 75 W 
xenon lamp) with power supply 2. There is an XY stage 3 to move the plate 4 in the 

25 XY direction over the microscope objective. A Z-axis focus drive 5 moves the 
objective in the Z direction for focusing, A joystick 6 provides for manual movement 
of the stage in the XYZ direction. A high resolution digital camera 7 acquires images 
from each well or location on the plate. There is a camera power supply 8^ an 
automation controller 9 and a central processing unit 10. The PG JLL provides a display 

30 12 and has associated software. The printer 13. provides for printing of a hard copy 
record. 

15 
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Figure 2 is a schematic of one embodiment of the microscope assembly 1 of flie 
invention, showing in more detail the XY stage 3, Z-axis focus drive 5, joystick £ light 
source 2, and automation controller £. Cables to the computer H and microscope 14 
respectively, are provided. In addition, Figure 2 shows a 96 well microtiter plate H 

5 which is moved on the XY stage 1 in the XY direction. Light from the light source 2 
passes through the PC controlled shutter 18 to a motorized filter wheel 19 with 
excitation filters 20. The light passes into filter cube 25 which has a dichroic mirror 26 
and an emission filter 22- Excitation light reflects off the dichroic minor to the wells in 
the microtiter plate 12 and fluorescent light 28 passes throu^ the dichroic mirror 26 

10 and the emission filter 27 and to the digital camera 7. 

Figure 3 shows a schematic drawing of a preferred camera assembly. The 
digital camera 7, which contains an automatic shutter for exposure control and a power 
supply 3L receives fluorescent light 28 fiom the microscope assembly. A digital cable 
30 transports digital signals to the computer. 

15 The standard optical configurations described above use microscope optics to 

directly produce an enlarged image of the specimen on the camera sensor in order to 
capture a high resolution image of the specimen. This optical system is commonly 
referred to as 'wide field' microscopy. Those skilled in the art of microscopy will, 
recognize that a high resolution image of the specimen can be created by a variety of 

20 other optical systems, including, but not Imiited to, standard scanning confocal 
detection of a focused point or line of illumination scanned over the specimen (Go et al. 
1997, supra), and multi-photon scanning confocal microscopy (Denk et al., 1990, 
supra), both of which can form images on a CCD detector or by synchronous 
digitization of the analog output of a photomultiplier tube. 

25 In screening applications, it is often necessary to use a particular cell line, or 

primary cell culture, to take advantage of particular features of those cells. Those 
skilled in the art of cell culture will recognize that some cell lines are contact inhibited, 
meaning that they will stop growing when they become surrounded by other cells, 
while other cell lines will continue to grow under those conditions and the cells will 

30 literally pile up, forming many layers. An example of such a cell line is the HEK 293 
(ATCC CRL-1573) line. An optical system that can acquire images of single cell 
layers in multilayer preparations is requured for use with cell lines that tend to form 
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layers. The large depth of field of wide field microscopes produces an image that is a 
projection through the many layers of ceUs, maldng analysis of subcellular spatial 
distributions extremely difficult in layer-foraiing cells. Alternatively, the very shallow 
depth of field that can be achieved on a confocal microscope, (about one micron), 

5 allows discrimination of a single cell layer at high resolution, simpUfymg the 
determination of the subcellular spatial distribution. Sunilarly, confocal inaaging is 
preferable when detection modes such as fluorescence lifetime imaging are required. ^ 
The output of a standard confocal imaging attachment for a microscope is a 
digital image that can be converted to the same format as the images produced by the 

10 other cell screening system embodiments described above, and can therefore be 
processed in exactly the same way as those images. The overall control, acquisition 
and analysis in this embodiment is essentially the same. The optical configuration of 
the confocal microscope system, is essentially the same as that described above, except 
for the illuminator and detectors. Illumination and detection systems required for 

15 confocal microscopy have been designed as accessories to be attached to standard 
microscope optical systems such as that of the present invention (Zeiss, Germany). 
These alternative optical systems therefore can be easily integrated into the system as 
described above. 

Figure 4 illustrates an alternative embodiment of the invention in which cell 
20 arrays are in microwells 40 on a microplate 4L described ion co-pending U.S. 
Application S/N 08/865,341, incorporated by- reference herein in its entirety. Typically 
the microplate is 20 mm by 30 mm as compared to a standard 96 well microtiter plate 
which is 86 mm by 129 mm. The higher density array of cells on a microplate allows 
the microplate to be imaged at a low resolution of a few microns per pbcel for high 
25 throughput and particular locations on the microplate to be imaged at a higher 
resolution of less than 0.5 microns per pixel. These two resolution modes help to 
improve the overall throughput of the system. - 

. The microplate chamber 42 serves as a microfluidic delivery system for the 
addition of compounds to cells. The microplate 41 in the microplate chamber 42 is 
30 placed in an XY microplate reader 43. Digital data is processed as described above. 
The small size of this microplate system increases throughput, minimizes reagent 
volume and allows control of the distribution and placement of cells for fast and precise 
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cell-based analysis. Processed data can be displayed on a PC screen H and made part 
of a bioinformatics data base 44. This data base not only permits storage and retrieval 
of data obtained through the methods of this invention, but also permits acquisition and 
storage of external data relating to cells. Figure 5 is a PC display which illustrates the 

5 operation of the software. 

In an alternative embodiment, a high throughput system (HTS) is directly 
coupled with the HCS either on the same platfomi or on two separate platforms 
connected electronically (e.g. via a local area network). This embodiment of the 
invention, referred to as a dual mode optical systmi, has the advantage of increasing the 

10 throughput of a HCS by coupling it with a HTS and thereby requiring slower high 
resolution data acquisition and analysis only on the small subset of wells that show a 
response in the coupled HTS. 

High throughput 'whole plate' reader systems are well known in the art and are 
commonly used as a component of an HTS system used to screen large munbers- of 

15 compounds (Beggs (1997), J. ofBiomolec. Screening 2:71-78; Macaffrey et al., (1996) 
J. Biomolec. Screening 1:1 87-190). 

In one embodiment of dual mode cell based screening, a two platform 
architecture in which high throughput acquisition occurs on one platform and high 
content acquisition occurs on a second platform is provided (Figure 6). Processing 

20 occurs on each platfonn independently, with results passed over a network interface, or 
a single, controller is used to process the data from both platforms. 

As illustrated in Figure 6, an exemplified two platfomi dual mode optical 
system consists of two light optical instraments, a high throughput platfonn 60 and a 
high content platform 65^ which read fluorescent signals emitted from cells cultured in 

25 microtiter plates or microwell arrays on a microplate, and communicate with each other 
via an electronic connection 64. The high throughput platfonn 60 analyzes all the wells 
in the whole plate either in parallel or rapid serial fashion. Those skilled in the art of 
screening will recognize that there are a many such commercially available high 
throughput reader systems that could be integrated into a dual mode cell based 

30 screening system (Topcount (Packard Instruments, Meriden, CT); Spectramax, 

Lumiskan (Molecular Devices, Sunnyvale, CA); Fluoroscan * (Labsystems, Beverly, 

MA)). The high content platform 65, as described above, scans from well to well and 
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acquires and analyzes high resolution image data collected from individual cells within 
a well. 

The HTS software, residing on the system's computer 62, controls the high 
throughput instmment, and results are displayed on the monitor 61. The HCS software, 
5 residing on it's computer system &L controls the high content instrument hardware 65, 
optional devices (e.g. plate loader, enviromnental chamber, fluid dispenser), analyzes 
digital image data from the plate, displays results on the monitor 6^ and manages data 
measured in an integrated database. The two systems can also share a single computer, 
in which case all data would be collected, processed and displayed on that computer, 
10 without the need for a local area network to transfer the data. Microtiter plates are 
transferred from the high throughput system to the high content system 63 either 
manually or by a robotic plate transfer device, as is well known in the art (Beggs 
(1997), jTupra; Mcaffrey (1996), supra). 

In a preferred embodiment, the dual mode optical system utilizes a single 
15 platform system (Figure 7). It consists of two separate optical modules, an HCS 
module 202 and an HTS module 209 that can be independently or coUectively moved 
so that only one at a time is used to collect data from the microtiter plate 201. The 
microtiter plate 201 is mounted in a motorized X,Y stage so it can be positioned for 
imaging in either HTS or HCS mode. After collecting and analyzing the HTS image 
20 data as described below, the HTS optical module 209 is moved out of tiie optical patii 
and the HCS optical module 203 is moved into place. 

The optical module for HTS 209 consists of a projection lens 2H. excitation 
wavelength filter 213 and dichroic mirror 210 which are used to illuminate tiae whole 
bottom of the plate with a specific wavelength band from a conventional microscope 
25 lamp system (not iUustrated). The fluorescence emission is collected through the 
dichroic mirror 210 and emission wavelength filter 2U by a lens 212 which forms an 
image on the camera 216 wifli sensor 215 . 

The optical module for HCS 2Q3 consists of a projection lens 208, excitation 
wavelength filter 207 and dichroic mirror 204 which are used to illuminate tiie back 
30 aperture of the microscope objective 202, and thereby the field of that objective, from a 
standard microscope illumination system (not shown). The fluorescence emission is 
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collected by the microscope objective 202, passes through the dichroic mirror 204 and 
emission wavelength filter 205 and is focused by a tube lens 206 which forms an image 
on the same camera 216 with sensor 215 . 

In an alternative embodiment of the present invention, the cell screening system 

5 further comprises a fluid delivery device for use with the live cell embodiment of the 
method of cell screening (see below). Figure 8 exemplifies a fluid delivery device for 
use with the system of the invention. It consists of a bank of 12 syringe pumps 701 
driven by a single motor drive. Each syringe 702 is sized according to the volume to be 
delivered to each well, typically between 1 and 100 ixL, Each syringe is attached via 

10 flexible tubing 7Q3 to a similar bank of comiectors which accept standard pipette tips 
705 . The bank of pipette tips are attached to a drive system so they can be lowered and 
raised relative to the microtiter plate 706 to deliver fluid to each well. The plate is 
mounted on an XY stage, allowing movement relative to the optical system 2Q2 for 
data collection purposes. This set-up allows one set of pipette tips, or even a single 

15 pipette tip, to deliver reagent to all the wells on the plate. The bank of syringe pumps 
can be used to deliver fluid to 12 wells simultaneously, or to fewer wells by removing 
some of the tips. 

In another aspect, the present invention provides a method for analyzing cells 
comprising providing an array of locations w^ich contain multiple cells wherein the 
20 cells contain one or more fluorescent reporter molecules; scanning multiple cells in 
each of the locations containing cells to obtain fluorescent signals fix)m the fluorescent 
reporter molecule in the cells; converting the fluorescent signals into digital data; and 
utihzing the digital data to determine the distribution, environment or activity of the 
fluorescent reporter molecule within the cells. 

25 

Cell Arrays 

Screening large numbers of compounds for activity with respect to a particular 
biological function requires preparing arrays of cells for parallel handling of cells and 
reagents. Standard 96 well microtiter plates which are 86 mm by 129 mm, with 6mm 
30 diameter wells on a 9mm pitch, are used for compatibility with current automated 
loading and robotic handling systems. The microplate is typically 20 mm by 30 mm, 
with cell locations' that are 100-200 microns in dimension on a pitch of about 500 
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microns. Methods for making microplates are described in U.S. Patent Application 
Serial No. 08/865,341, incorporated by reference herein in its entirety. Microplates 
may consist of coplanar layers of materials to which cells adhere, patterned with 
materials to which cells will not adhere, or etched S-dimensional surfaces of similarly 

5 pattered materials. For the purpose of the following discussion, the terms 'well* and 
'microweir refer to a location in an array of any construction to which cells adhere and 
within which the cells are imaged, Microplates may also include fluid delivery 
channels in the spaces between the wells. The smaller format of a microplate increases 
the overall efficiency of the system by minimizmg the quantities of the reagents, 

10 storage and handling during preparation and the overall movement required for the 
scanning operation. In addition, the whole area of the microplate can be imaged more 
efficiently, allowing a second mode of operation for the microplate reader as described 
later in this document. 
Fluorescence Reporter Molecules 

15 A major component of the new drug discovery paradigm is a continually 

growing family of fluorescent and luminescent reagents that are used to measure the 
temporal and spatial distribution, content, and activity of intracellular ions, metabolites, 
macromolecules, and organelles. Classes of these reagents include labeling reagents 
that measure the distribution and amount of molepules in living and fixed cells, 

20 environmental indicators to report signal transduction events in time and space, and 
fluorescent protein biosensors to measure target molecular activities within living cells. 
A multiparameter approach that combines several reagents in a single cell is a powerfiil 
new tool for dmg discovery. 

The method of the present invention is based on the high affinity of fluorescent 

25 or luminescent molecules for specific cellular components. The affinity for specific 
components is governed by physical forces such as ionic interactions, covalent bonding 
(which includes chimeric fusion with protein-based chromophores, fluorophores, and 
lumiphores), as well as hydrophobic interactions, electrical potential, and, in some 
cases, simple entrapment within a cellular component The luminescent probes can be 

30 small molecules, labeled macromolecules, or genetically engineered proteins, 
including, but not limited to green fluorescent protein chimeras. 
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Those skiUed in this art wiU recognize a wide variety of fluorescent reporter 
molecules that can be used in the present invention, including, but not limited to, 
fluorescently labeled biomolecules such as proteins, phosphoUpids and DNA 
hybridizing probes. Similarly, fluorescent reagents specifically synthesized with 
particular chemical properties of binding or association have been used as fluorescent 
reporter molecules (Barak et al., (1997), J. Biol. Chem. 272:27497-27500; Southwick et 
al.. (1990). Cytometry 11:418-430; Tsien (1989) in Methods in Cell Biology, Vol. 29 
Taylor and Wang (eds.), pp. 127-156). Fluorescently labeled antibodies are particularly 
useful reporter molecules due to their high degree of specificity for attaching to a single 
molecular target in a mixture of molecules as complex as a cell or tissue. 

The luminescent probes can be synthesized within the living cell or can be 
transported into the cell via several non-mechanical modes including diffusion, 
facilitated or active transport, signal-sequence-mediated transport,, and endocytotic or 
pinocytotic uptake. Mechanical bulk loading methods, which are well known in the art, 
can also be used to load luminescent probes into living cells (Barber et al. (1996), 
Neuroscience Letters 207:17-20; Bright et al. (1996), Cytometry lA-Jne-llZ', McNeil 
(1989) in Methods in Cell Biology, Vol. 29, Taylor and Wang (eds.), pp. 153-173). 
These methods include electroporation and other mechanical methods such as scrape- 
loading, bead-loading, impact-loading, syringe-loading, hypertonic and hypotonic 
loading. Additionally, cells can be genetically engineered to express reporter 
molecules, such as GFP, coupled to a protein of interest as previously described 
(Chalfie and Prasher U.S. Patent No. 5.491,084; Gubitt et al. (1995), Trends in 
Biochemical Science 20:448-455). 

Once in the cell, the luminescent probes accumulate at their target domain as a 
result of specific and high affinity mteractions with the target domain or other modes of 
molecular targeting such as signal-sequence-mediated transport. Fluorescently labeled 
reporter molecules are usefiil for determinmg the location, amount and chemical 
environment of the reporter. For example, whether the reporter is in a lipophilic 
membrane environment or in a more aqueous environment can be determined (Giuliano 
et al. (1995), Ann. Rev. of Biophysics and Biomolecular Structure 24:405-434; Giuliano 
and Taylor (1995), Methods in Neuroscience 27:1-16). The pH environment of the 

reporter can be deteraiined (Bright et al. (1989), J. Cell Biology 104:1019-1033; 
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Giuliano et al. (1987). Anal. Biochem. 167:362-371; Thomas et al. (1979). 
Biochemistry 18:2210-2218). It can be determined whether a reporter having a 
chelating group is bound to an ion, such as Ca-H-. or not (Bright et al. (1989), In 
Methods in Cell Biology, Vol. 30, Taylor and Wang (eds.). pp. 157-192; Shimoura et al. 
5 (1988), J. of Biochemistry' (Tokyo) 251:405-410; Tsien (1989) In Methods in Cell 
Biology, Vol. 30, Taylor and Wang (eds.), pp. 127-156). 

Furthermore, certain cell types within an organism may contain components 
that can be specificaUy labeled thkt may not occur in other ceU types. For example, 
epithelial cells often contain polarized membrane components. That is, these cells 
10 asymmetricaUy distribute macromolecules along their plasma membrane. Connective 
or supporting tissue cells often contain granules in which are trapped molecules specific 
to that cell type (e.g., heparin, histamine, serotonin, etc.). Most muscular tissue cells 
contain a sarcoplasmic reticulum, a specialized organeUe whose fimction is to regulate 
the concentration of calcium ions within the cell cytoplasm. Many nervous tissue cells 
15 contain secretory granules and vesicles in which are trapped neurohormones or 
neurotransmitters. Therefore, fluorescent molecules can be designed to label not only 
specific components within specific cells, but also specific cells within a population of 
mixed cell types. 

Those skilled in the art will recognize a wide variety of ways to measure 
20 fluorescence. For example, some fluorescent reporter molecules exhibit a change in 
excitation or emission spectra, some exhibit resonance energy transfer where one 
fluorescent reporter loses fluorescence, while a second gains in fluorescence, some 
exhibit a loss (quenching) or appearance of fluorescence, while some report rotational 
movements (Giuliano et al. (1995), Ann. Rev. of Biophysics and Biomol Structure 
25 24:405-434; Giuliano et al. (1995), Methods in Neuroscience 27:1-16). 
Scanning cell arrays 

Referring to Figure 9, a preferred embodiment is provided to analyze cells that 
comprises operator-directed parameters being selected based on the assay being 
conducted, data acquisition by the cell screening system on the distribution of 
30 fluorescent signals within a sample, and interactive data review and analysis. At the 
start of an automated scan the operator enters information 100 that describes tiie 
sample specifies the filter settings and fluorescent channels to match the biological 
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labels being used and the information sought, and then adjusts the camera settings to 
match the sample brightness. For flexibility to handle a range of samples, the software 
allows selection of various parameter settings used to identify nuclei and cytoplasm, 
and selection of different fluorescent reagents, identification of cells of interest based 

5 on morphology or brightness, and cell numbers to be analyzed per well. These 
parameters are stored in the system's for easy retrieval for each automated run. The 
system's interactive cell identification mode simplifies the selection of morphological 
parameter limits such as the range of size, shape, and intensity of cells to be analyzed. 
The user specifies which wells of the plate the system will scan and how many fields or 

10 how many cells to analyze in each well. Depending on the setup mode selected by the 
user at step lOJL the system either automatically pre-focuses the region of the plate to 
be scanned using an autofocus procedure to "find focus" of the plate 102 or the user 
interactively pre-focuses 103 the scanning region by selecting three "tag" points which 
define the rectangular area to be scanned. A least-squares, fit "focal plane model" is 

15 then calculated fi-om these tag points to estimate the focus of each well during an 
automated scan. The focus of each well is estimated by interpolating firom the focal 
plane model during a scan. 

During an automated scan, the software dynamically displays the scan status, 
including the number of cells analyzed, the current well being analyzed, images of each 

20 independent wavelength as they are acquired, and the result of the screen for each well 
as it is determined. The plate 4 (Figure 1) is scanned in a serpentine style as the 
software automatically moves the motorized microscope XY stage 3 fi-om well to well 
and field to field within each well of a 96-well plate. Those skilled in the programming 
art will recognize how to adapt software for scanning of other microplate formats such 

25 as 24, 48, and 384 well plates. The scan pattem of the entire plate as well as the scan 
pattern of fields within each well are programmed. The system adjusts sample focus 
with an autofocus procedm-e 1Q4 (Figure 9) through the Z axis focus drive 5, controls 
filter selection via a motorized filter wheel 19^ and acquires and analyzes images of up 
to four different colors ("channels" or * Vavelengths"), 

30 The autofocus procedure is called at a user selected firequency, typically for the 

first field in each well and then once every 4 to 5 fields within each well. The autofocus 
procedure calculates the starting Z-axis point by interpolating fi-om the pre-calculated 
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plane focal model. Starting a programmable distance above or below this set point, the 
procedure moves the mechanical Z-axis through a number of different positions, 
acquires an image at each position, and finds the maximum of a calculated focus score 
that estimates the contrast of each image. The Z position of the unage with the 

5 maximum focus score determines the best focus for a particular field. Those skilled in 
the art will recognize this as a variant of automatic focusing methods as described in 
Harms et al. in Cytometry 5 (1984), 236-243, Groen et al. in Cytometry 6 (1985). 81-91, 
and Firestone et al. in Cytometry 12 (1991), 195-206. 

For image acquisition, the camera's exposure time is separately adjusted for 

1 0 each dye to ensure a high-quality image from each channel. Software procedures can be 
called, at the user's option, to correct for registration shifts between wavelengths by 
accounting for linear (X and Y) shifts between wavelengths before making any further 
measurements. The electronic shutter 18 is controlled so that sample photo-bleaching is 
kept to a minimum. Background shading and uneven illumination can be corrected by 

15 the software using methods known in the art ^Bright et al. (1987), J. Cell Biol. 
104:1019-1033). 

In one channel, images are acquired of a primary marker 105 (Figure 9) 
(typically cell nuclei counterstained with DAPI or PI fluorescent dyes) which are 
segmented ("identified") using an adaptive thresholding procedure. The adaptive 

20 thresholding procedure IM is used to dynamically select the threshold of an image for 
separating cells from the background. The staining of ceUs with fluorescent dyes can 
vary to an tmknown degree across cells in a microtiter plate sample as well as within 
images of a field of cells within each well of a microtiter plate. This variation can occur 
as a result of sample preparation and/or the dynamic nature of cells. A global threshold 

25 is calculated for the complete unage to separate the cells from background and account 
for field to field variation. These global adaptive techniques are variants of those 
described in the art. (Kittler et al. in Computer Vision, Graphics, and Image 
Processing 30 (1985), 125-147, Ridler et al. in IEEE Trans. Systems. Man. and 
Cybernetics (1 978), 630-632.) 

30 An alternative adaptive thresholding method utilizes local region thresholding 

in contrast to global image thresholding. - Image analysis of local regions leads to better 
overall segmentation since staining of cell nuclei (as well as other labeled components) 
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can vary across an image. Using this globamocal procedure, a reduced resolution 
image (reduced in size by a factor of 2 to 4) is first globally segmented (using adaptive 
thresholding) to find regions of interest in the image. These regions then serve as 
guides to more fiilly analyze the same regions at full resolution. A more localized 
threshold is then calculated (again using adaptive thresholding) for each region of 
interest. 

The output of the segmentation procedure is a binary image wherein the objects 
are white and the background is black. This binary image, also called a mask in the art, 
is used to determine if the field contains objects m The mask is labeled with a blob 
labeling method whereby each object (or blob) has a unique number assigned to it. 
Morphological features, such as area and shape, of the blobs are used to differentiate 
blobs likely to be cells from those that are considered artifacts. The user pre-sets the 
morphological selection criteria by either typing in known cell morphological features 
or by using the interactive training utility. If objects of interest are found in the field, 
images are acquired for all other active channels 108, otherwise the stage is advanced 
to the next field 109 in the current well. Each object of interest is located m the image 
for further analysis 110. The software determines if the object meets the criteria for a 
valid cell nucleus HI by measuring its morphological features (size and shape). For 
each valid cell, the XYZ stage location: is recorded, a small image of the cell is stored, 
and features are measured 112 . 

The cell scanning method of the present invention can be used to perform many 
different assays on cellular samples by applying a numiber of analytical methods 
simultaneously to measure features at multiple wavelengths. An example of one such 
assay provides for the following measurements: 

1. The total fluorescent intensity within the cell nucleus for colors 1-4 

2. The area of the cell nucleus for color 1 (the primary marker) 

3. The shape of the cell nucleus for color 1 is described by tturee shape 
features: 

a) perimeter squared area 

b) box area ratio j 

c) height width ratio 

4. The average fluorescent intensity within the cell nucleus for colors 1-4 (i.e. 
#1 divided by #2) 

5. The total fluorescent intensity of a ring outside the nucleus (see Figure 10) 
that represents fluorescence of the cell's cytoplasm (cytoplasmic mask) for 
colors 2-4 
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6. The area ofthe cytoplasmic mask , o >. 

7. The average fluorescent intensity of the cytoplasmic mask for colors 2-4 
(i.e. #5 divided by #6) . 

8 The ratio of the average fluorescent intensity of the cytoplasmic mask to 
average fluorescent intensity within the cell nucleus for colors 2-4 (i.e. #7 

dividedby#4) . ^ , • i 

9 The difference of the average fluorescent intensity of the cytoplasmic mask 
and the average fluorescent intensity within the cell nucleus for colors 2-4 
(i.e. #7 minus #4) . 

10. The number of fluorescent domains (also call spots, dots, or grains) wittvm 
the cell nucleus for colors 2-4 



Features 1 through 4 are general features ofthe different cell screening assays 
of the invention. These steps are commonly used in a variety of image analysis 
applications and are well known in art (Ross (1992) The Image Processing Handbook, 
CRC Press Inc.; Gonzales et al. (1987), Digital Image Processing. Addison-Wesley 
Publishing Co. pp. 391-448). Features 5-9 have been developed specifically to provide 
measurements of a cell's fluorescent molecules within the local cytoplasmic region of 
the cell and the translocation (i.e. movement) of fluorescent molecules firom the 
20 cytoplasm to the nucleus. These features (steps 5-9) are used for analyzing cells in 
microplates for the inhibition of nuclear translocation. For example, inhibition of 
nuclear translocation of transcription factors provides a novel approach to screening 
intact cells (detailed examples of other types of screens wiU be provided below). A 
specific method measures the amount of probe ui the nuclear region (feature 4) versus 
25 the local cytoplasmic region (feature 7) of each cell. Quantification of the difference 
between these two sub-cellular compartments provides a measure of cytoplasm-nuclear 
translocation (feature 9). 

Feature 10 describes a screen used for coimting of DNA or RNA probes within 
the nuclear region m colors 2-4. For example, probes are commercially available for 
30 identifying chromosome-specific DNA sequences (Life Technologies, Gaithersburg, 
MD; Genosys, Woodlands, TX; Biotechnologies, Lie, Richmond, CA; Bio 101, Inc.. 
Vista, CA) Cells are three-dimensional in nature and when examined at a high 
magnification under a microscope one probe may be in-focus while another may be 
completely out-of-focus. The cell screening method of the present invention provides 
35 for detecting three-dimensional probes in nuclei by acquiring images firom multiple 
focal planes. The software moves the Z-axis motor drive i (Figure 1) in small steps 
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where the step distance is user selected to account for a wide range of different nuclear 
diameters. At each of the focal steps, an image is acquired. The maximum gray-level 
intensity from each pixel in each image is found and stored in a resulting maximum 
projection image. The maximum projection image is then used to count the probes. The 

5 above method works well in counting probes that are not stacked directly above or 
below another one. To account for probes stacked on top of each other in the Z- 
direction, users can select an option to analyze probes in each of the focal planes 
acquired. In this mode, the scanning system performs the maxhnum plane projection 
method as discussed above, detects probe regions of interest in this image, then further 

1 0 analyzes these regions in all the focal plane images. 

After measuring cell features 112 (Figure 9), the system checks if there are any 
unprocessed objects in the current field 113- If there are any unprocessed objects, it 
locates the next object JJO and determines whether it meets the criteria for a valid cell 
nucleus 111, and measures its features. Once all the objects in the current field are 

15 processed, the system determines whether analysis of the current plate is complete 114; 
if not, it determines the need to find more cells in the current well 115. If the need 
exists, the system advances the XYZ stage to the next field within the current well 109 
or advances the stage to the next well li6 of the plate. 

After a plate scan is complete, images and data can be reviewed with the 

20 system's image review, data review, and summary review facilities. All images, data, 
and settings from a scan are archived in the system's database for later review or for 
interfacing with a network information management system. Data can also be exported 
to other third-party statistical packages to tabulate results and generate other reports. 
Users can review the images alone of every cell analyzed by the system with an 

25 interactive image review procedure il7- The user can review data on a cell-by-ceU 
basis using a combination of interactive graphs, a data spreadsheet of measured 
features, and images of all the fluorescence channels of a cell of interest with the 
interactive cell-by-cell data review procedure US. Graphical plotting capabilities are 
provided in which data can be analyzed via interactive graphs such as histograms and 

30 scatter plots. Users can review summary data that are accumulated and summarized for 
all cells within each well of a plate with an interactive well-by-well data review 
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procedure 112. Hard copies of graphs and images can be printed on a wide range of 
standard printers. 

As a final phase of a complete scan, reports can be generated on one or more 
statistics of the measured features. Users can generate a graphical report of data 
summarized on a well-by-weU basis for the scanned region of the plate using an 
interactive report generation procedure 120- This report includes a summary of the 
statistics by well in tabular and graphical format and identification information on the 
sample. The report window aUows the operator to enter comments about the scan for 
later retrieval. Multiple reports can be generated on many statistics and be printed with 
the touch of one button. Reports can be previewed for placement and data before being 
printed. 

The above-recited embodiment of the method operates in a single high 
resolution mode referred to as the high content screemng (HCS) mode. The HCS mode 
provides sufficient spatial resolution within a well (on the order of 1 \xm) to define the 
distribution of .material within the weU, as well as within individual cells in the well. 
The high degree of information content accessible in that mode, comes at the expense 
of speed and complexity of the required signal processing. 

In an alternative embodiment, a high throughput system (HTS) is directly 
cdupled with tiie HCS either on the same platform or on two separate platforms 
connected electronicaUy (e.g. via a local area network). This embodiment of the 
invention, referred to as a dual mode optical system, has the advantage of increasing the 
tiiroughput of an HCS by coupling it with an HTS and thereby requiring slower high 
resolution data acquisition and analysis only on the small subset of wells fliat show a 
response in the coupled HTS. 

High throughput 'whole plate' reader systems are well known in the art and are 
commonly used as a component of an HTS system used to screen large numbers of 
compounds (Beggs et al. (1997), supra; McCaffrey et al. (1996), supra ). The HTS of 
the present invention is carried out on the microtiter plate or microwell array by reading 
many or all wells in the plate simultaneously with sufficient resolution to make 
D determinatioi\s on a weU-by-well basis. That is, calculations are made by averaging the 
total signal output of many or all the cells or the bulk of the material in each well. 
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Wells that exhibit some defined response in the HTS (the 'hits') are flagged by the 
system. Then on the same microtiter plate or microwell array, each well identified as a 
hit is measured via HCS as described above. Thus, the dual mode process involves: 

1 . Rapidly measuring numerous wells of a microtiter plate or microwell array, 
5 2 Interpreting the data to determine the overall activity of fluorescently labeled 
reporter molecules in the cells on a well-by-well basis to identify "hits" (wells that 
e:dubit a defined response), 

3. Imaging numerous cells in each "bit" well, and 

4. Interpretmg the digital image data to determine the distribution, environment or 
10 activity of the fluorescently labeled reporter molecules in the individual cells (i.e. 

intracellular measurements) and the distribution of the cells to test for specific 
biological functions 

In a prefen-ed embodiment of dual mode processing (Figure 11), at the start of a 

15 run 30i, the operator enters information 302 that describes the plate and its contents, 
specifies the filter settings and fluorescent channels to match the biological labels being 
used, the inforaiation sought and the camera settings to match the sample brightness. 
These parameters are stored in the system's database for easy retrieval for each 
automated run. The microtiter plate or microwell array is loaded mto the cell screening 

20 ^ system 303 either manually or automatically by controlling a robotic loading device. 
An optional environmental chamber 304 is controlled by the system to maintain the 
temperattire, humidity and CO2 levels in the air surroundmg live cells m the microtiter 
plate or microwell array. An optional fluid delivery device 305 (see Figure 8) is 
controlled by the system to dispense fluids into the wells during the scan. 

25 High throughput processing is first perforaied on the microtiter plate or 

microwell array by acquiring and analyzing the signal fix>m each of the wells in the 
plate. The processing performed in high throughput mode 302 is illushrated in Figure 12 
and described below. Wells that exhibit some selected intensity response in this high 
throughput mode ("hits") are identified by the system. The system performs a 

30 conditional operation 3Q8 that tests for hits. If hits are found, those specific hit wells are 
fiirther analyzed m high content (micro level) mode 309. The processing performed in 
high content mode. 312 is illustrated in Figure 13. The system then updates 310 the 
informatics database 3il with results of the measurements on the plate. If thrae are 
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more plates to be analyzed 311 the system loads the next plate otherwise tiie 
analysis of the plates terminates 314 . 

The following discussion describes the high throughput mode illustrated in 
Figure 12. The preferred embodiment of the system, the single platform dual mode 
5 screening system, will be described. Those skilled in the art will recognize that 
operationally the dual platform system simply involves moving the plate between two 
optical systems rather than moving the optics. Once the system has been set up and the 
plate loaded, the system begins the HTS acquisition and analysis 401. The HTS optical 
module is selected by controUmg a motorized optical positioning device 4Q2 on the 
10 dual mode system. In one fluorescence channel, data from a primary marker on the 
plate is acquired 4fi3 and wells are isolated from the plate background using a masking 
procedure 404. Images are also acquired in other fluorescence channels being used 40i- 
The region in each image, corresponding to each well 406 is measured 402. A feature 
calculated from the measurements for a particular well is compared with a predefined 
1 5 threshold or intensity response 408. and based on the result the well is either flagged as 
a "hit" 40£ or not. The locations of the wells flagged as hits are recorded for 
subsequent high content mode processing. If there are wells remaming to be processed 
410 the program loops back 406 until all the wells have been processed 411 and the 
system exits high throughput mode. 
20 Following HTS analysis, the system starts the high content mode processing 

501 defined in Figure 13. The system selects flie HCS optical module 5fl2 by 
controlling the motorized positioning system. For each "hit" well identified in high 
throughput mode, the XY stage location of the well is retrieved from memory or disk 
and the stage is then moved to the selected stage location 5Qi. The autofocus procedure 
25 504 is called for the first field in each hit well and then once every 5 to 8 fields within 
each well. In one charaiel, images are acquired of the primary marker 505 (typically 
cell nuclei counterstained with DAPI, Hoechst or PI fluorescent dye). The images are 
then segmented (separated into regions of nuclei and non-nuclei) using an adaptive 
thresholding procedure 506 . The output of the segmentation procedure is a binary mask 
30 wherein the objects are white and the background is black. This binary image, also 
called a mask in the art, is used to determine if the field contains objects 507. The mask 
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is labeled with a blob labeling method whereby each object (or blob) has a unique 
number assigned to it. If objects are found in the field, images are acquired for all other 
active chanaiels 508. otherwise the stage is advanced to the next field 514 in the current 
well. Each object is located in the image for further analysis 509. Morphological 

5 features, such as area and shape of the objects, are used to select objects likely to be 
cell nuclei 510. and discard (do no further processing on) those that are considered 
artifacts. For each valid cell nucleus, the XYZ stage location is recorded, a small image 
of the cell is stored, and assay specific features are measured 511 . The system then 
performs multiple tests on the cells by applying several analytical methods to measure 

10 features at each of several wavelengths. After measuring the cell features, the systems 
checks if there are any unprocessed objects in the current field 512. If there are any 
unprocessed objects, it locates the next object 509 and determines whether it meets the 
criteria for a valid cell nucleus 510. and measures its features. After processing all the 
objects in the current field, the system deteremines whether it needs to find more cells 

15 or fields in the current well 511. If it needs to find more cells or fields in the current 
well it advances the XYZ stage to the next field within the current well 515. 
Otherwise, the system checks whether it has any remauiing hit wells to measure 51^. If 
so, it advances to the next hit well 503 and proceeds through another cycle of 
acquisition and analysis, otherwise the HCS mode is finished 516 . 

20 In an alternative embodiment of the present invention, a method of kinetic live 

cell screening is provided. The previously described embodiments of the invention are 
used to characterize the spatial distribution of cellular components at a specific point in 
time, the tune of chemical fixation. As such, these embodiments have, limited utility 
for implementing kinetic based screens, due to the sequential nature of the image 

25 acquisition, and the amount of time required to read all the wells on a plate. For 
example, suice a plate can require 30 - 60 minutes to read through all the wells, only 
very slow kinetic processes can be measured by simply preparing a plate of live cells 
and then reading through all the wells more than once. Faster kinetic processes can be 
measured by taking multiple readings of each weU before proceeding to the next well, 

30 but the elapsed time between the first and last well would be too long, and fast kinetic 
processes would likely be complete before reaching the last well. 
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The kinetic live cell extension of the invention enables the design and use of 
screens in which a biological process is characterized by its kinetics instead of, or in 
addition to, its spatial characteristics. In many cases, a response in Uve cells can be 
measured by adding a reagent to a specific well and making multiple measurements on 
5 that well with the appropriate timing. This dynamic Uve cell embodiment of the 
invention therefore includes apparatus for fluid delivery to mdividual wells of the 
system in order to deliver reagents to each weU at a specific time in advance of reading 
the well. This embodiment thereby allows kinetic measurements to be made with 
temporal resolution of seconds to minutes on each well of the plate. To improve the 
10 overall efficiency of the dynamic Uve cell system, the acquisition control program is 
modified to allow repetitive data collection from sub-regions of the plate, allowmg the 
system to read other wells between the time points required for an individual well. 

Figure 8 describes an example of a fluid delivery device for use with the live 
ceU embodiment of the invention and is described above. This set-up allows one set of 
15 pipette tips 7Q5, or even a single pipette tip, to deliver reagent to all the wells on the 
plate. The bank of syringe pumps 201 can be used to deliver fluid to 12 wells 
simultaneously, or to fewer wells by removing some of the tips 705. The temporal 
resolution of the ^stem can therefore be adjusted, without sacrificing data collection 
efficiency, by changing the number of tips and the scan pattern as foUows. Typically, 
20 the data collection and analysis from a smgle well takes about 5 seconds. Moving firom 
well to well and focusing in a weU requires about 5 seconds, so the overall cycle time 
for a weU is about 10 seconds. Therefore, if a single pipette tip is used to deliver fluid 
to a single well, and data is collected repetitively &om that well, measurements can be 
made with about 5 seconds temporal resolution. If 6 pipette tips are used to deliver 
25 fluids to 6 wells simultaneously, and the system repetitively scans aU 6 wells, each scan 
will requke 60 seconds, thereby estabUshing the temporal resolution. For slower 
processes which only require data collection every 8 minutes, fluids can be delivered to 
one half of the plate, by moving the plate during the fluid deUvery phase, and then 
repetitively scanning that half of the plate. Therefore, by adjusting the size of the sub- 
30 region being scanned on the plate, the temporal resolution can be adjusted without 
having to insert wait times between acquisitions. Because the system is continuously 

scanning and acquiring data, the overall time to collect a kinetic data set from the plate 
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is then simply the time to perform a single scan of the plate, multiplied by the mmiber 
of time points required. Typically, 1 time point before addition of compounds and 2 or 
3 time points following addition should be sufBcient for screening purposes. 

Figure 14 shows the acquisition sequence used for kinetic analysis. The start of 
processing 8Q1 is configuration of the system, much of which is identical to the 
standard HCS configuxadon. In addition, the operator must enter information specific 
to the kinetic analysis being performed 802, such as the sub-region size, the number of 
time points required, and the required time increment. A sub-region is a groiqj of wells 
that will be scanned repetitively in order to accumulate kinetic data. The size of the 
sub-region is adjusted so that the system can scan a whole sub-region once during a 
single time increment, thus minimizing wait times. The optimum sub-region size is 
calculated firom the setup parameters, and adjusted if necessary by the operator. The 
system then moves the plate to the first sub-region 803, and to the first well in that sub- 
region 804 to acquire the prestimulation (time = 0) time points. The acquisition 
sequence performed in each well is exactly the same as that required for the specific 
HCS being run in kinetic mode. Figure 15 details a flow chart for diat processing. All 
of the steps between the start 901 and the return 902 are identical to those described as 
steps 504 - 514 in Figure 13. 

After processing each well in a sub-region, the system checks to see if all the 
wells in the sub-region have been processed 80^ (Figure 14), and cycles through all the 
wells until the whole region has been processed. The system then moves the plate into 
position for fluid addition, and controls fluidic system delivery of fluids to the entire 
sub-region 807. This may require multiple additions for sub-regions which span 
several rows on the plate, with the system moving the plate on the X,Y stage between 
additions. Once the fluids have been added, the system moves to the first well in the 
sub-region 808 to begin acquisition of time points. The data is acquired fix>m each well 
809 and as before the system cycles through all the wells m the sub-region %10. After 
each pass through flie sub-region, the system checks whether all the time points have 
been collected £11 and if not, pauses 811 if necessary 812, to stay synchronized with the 
requested time increment. Otherwise, the system checks for additional sub-regions on 
the plate 814 and either moves to the next sub-region 8fi3 or finishes 815. Thus, the 
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kinetic analysis mode comprises operator identification of sub-regions of the microtiter 
plate or microwells to be screened, based on the kinetic response to be investigated, 
with data acquisitions within a sub-region prior to data acquisition in subsequent sub- 
regions. 

5 Specific Screens 

In another aspect of the present invention, cell screening methods and machine 
readable storage medium comprising a program containing a set of instmctions for 
causing a cell screening system to execute procedures for defining the distribution and 
activity of specific cellular constituents and processes is provided. In a preferred 

10 embodiment, the cell screening system comprises a high magnification fluorescence 
optical system with a stage adapted for holding cells and a means for movmg the stage, 
a digital camera, a light source for receiving and processing the digital data from the 
digital camera, and a computer means for receiving and processing the digital data fi-om 
the digital camera.' This aspect of the invention comprises programs that instruct the 

15 cell screening system to define the distribution and activity of specific cellular 
constituents and processes, using the luminescent probes, the optical imaging system, 
and the pattem recognition software of the invention. Preferred embodiments of the 
machine readable storage medium comprise programs consisting of a set of instructions 
for causing a cell screening system to execute the procedures set forth in Figiures 9, 11, 

20 12, 13, 14 or 15. Another preferred embodiment comprises a program consisting of a 
set of instructions for causing a cell screening system to execute procedures for 
detecting the distribution and activity of specific cellular constituents and processes. In 
most preferred embodiments, the cellular processes include, but are not limited to, 
nuclear translocation of a protein, cellular morphology, apoptosis, receptor 

25 internalization, and protease-induced translocation of a, protein. 

In a preferred embodiment, the cell screening methods are used to identify 
compounds that modify the various cellular processes. The cells can be contacted with 
a test compound, and the effect of the test compound on a particular cellular process 
can be analyzed. Alternatively, the cells can be contacted with a test compound and a 
30 known agent that modifies the particular cellidar process, to determine whether the test 
compound can inhibit or enhance the effect of the known agent. Thus, the methods can 
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be used to identify test compoxmds that increase or decrease a particular cellular 
response, as well as to identify test compounds that affects the ability of other agents to 
increase or decrease a particular cellular response. 

In another preferred embodiment, the locations containing cells are analyzed 
5 using the above methods at low resolution in a high throughput mode, and only a subset 
of the locations contaming cells are analyzed in a high content mode to obtain 
luminescent signals from the luminescently labeled reporter molecules in subcellular 
compartments of the cells being analyzed. 

The following examples are intended for purposes of illustration only and 
10 should not be construed to limit the scope of the mvention, as defined m the claims 
appended hereto. 

The various chemical compounds, reagents, dyes, and antibodies that are 
referred to in the following Examples are commercially available from such sources as 
Sigma Chemical (St Louis, MO), Molecular Probes (Eugene, OR), Aldrich Chemical 
15 Company (Milwaukee, WI), Accurate Chemical Company (Westbury, NY), Jackson 
hnmunolabs, and Clontech (Palo Alto, CA), 

Example 1 Cytoplasm to I^ucleus Translocation Screening: 

a. Transcription Factors 

Regulation of transcription of some genes involves activation of a transcription 
factor in the cytoplasm, resulting in that factor being transported into the nucleus where 
it can initiate transcription of a particular gene or genes. This change in transcription 
factor distribution is the basis of a screen for the cell-based screening system to detect 
compounds that inhibit or induce transcription of a particular gene or group of genes. 
A general description of the screen is given followed by a specific example. 

The distribution of the transcription factor is determined by labeling the nuclei 
with a DNA specific fluorophore like Hoechst 33423 and the transcription factor with a 
specific fluorescent antibody. After autofocusmg on the Hoechst labeled nuclei, an 
image of the nuclei is acquired in the cell-based screening system and used to create a 
mask by one of several optional thresholding methods, as described supra. The 
morphological descriptors of the regions defined by the mask are compared with the 
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user defined parameters and valid nuclear masks are identified and used with the 
following method to extract transcription factor distributions. Each valid nuclear mask 
is eroded to define a slightly smaller nuclear region. The original nuclear mask is then 
dilated in two steps to define a ring shaped region around the nucleus, which represents 

5 a cytoplasmic region. The average antibody fluorescence in each of these two regions 
is determined, and the difference between these averages is defined as the NucCyt 
Difference. Two examples of determining nuclear translocation are discussed below 
and illustrated in Figure lOA-J. Figure lOA illustrates an unstunulated cell with its 
nucleus 200 labeled with a blue fluorophore and a transcription factor in the cytoplasm 

10 201 labeled with a green fluorophore. Figure lOB illustrates the nuclear mask 2Q2 
derived by the cell-based screening system. Figure IOC illustrates the cytoplasm 203 
of the unstimulated cell imaged at a green wavelength. Figure lOD illustrates the 
nuclear mask 202 is eroded (reduced) once to define a nuclear samplmg region 204 
with minimal cytoplasmic distribution. The nucleus boundary 202 is dilated (expanded) 

15 several times to form a ring that is 2-3 pixels wide fliat is used to define the 
cytoplasmic sampling region 205 for the same cell. Figure lOE further illustrates a side 
view which shows the nuclear sampling region 204 and the cytoplasmic sampling 
region 2Q1. Using these two sampling regions, data on nUclear translocation can be 
automatically analyzed by the cell-based screening system on a cell by cell basis. 

20 Figure lOF-J illustrates the strategy for determining nuclear translocation in a 
stimulated cell. Figure lOF illustrates a stimulated cell with its nucleus 206 labeled with 
a blue fluorophore and a transcription factor in the cytoplasm 222 labeled with a green 
fluorophore. The nuclear mask 208 in Figure lOG is derived by the ceU based 
screening system. Figure lOH illustrates the cytoplasm 202 of a stimulated cell imaged 

25 at a green wavelength. Figure 101 illustrates the nuclear sampling region 211 and 
cytoplasmic sampling region 211 of the stimulated cell. Figure lOJ fiirther illustrates a 
side view which shows the nuclear sampUng region 211 and the cytoplasmic sampling 
region 212 . 

A specific application of this method has been used to validate this method as a 
30 screen. A human cell line was plated in 96 well microtiter plates. Some rows of wells 
were titrated with lL-1, a known inducer of the NF-KB transcription factor. The cells 
were then fixed and stained by standard methods with a fluorescein labeled antibody to 
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the transcription factor, and Hoechst 33423. The cell-based screening system was used 
to acquire and analyze images from this plate and tiie NucCyt Difference was found to 
be strongly correlated with the amount of agonist added to the wells as illustrated in 
Figure 16. In a second experiment, an antagonist to the receptor for IL-1, IL-IRA was 

5 titrated in the presence of IL-1 a, progressively inhibiting the translocation induced by 
IL-1 a. The NucCyt Difference was found to strongly correlate with this inhibition of 
translocation, as illustrated in Figure 17. 

Additional experiments have shown that the NucCyt Difference, as well as the 
NucCyt ratio, gives consistent results over a wide range of cell densities and reagent 

10 concentrations, and can therefore be routinely used to screen compound libraries for 
specific nuclear translocation activity. Furthermore, the same method can be used with 
antibodies to other transcription factors, or GFP-transcription factor chimeras, or 
fluorescently labeled transcription factors introduced into living or fixed cells, to screen 
for effects on the regulation of transcription factor activity. 

15 Figure 18 is a representative display on a PC screen of data which was obtained 

in accordance with Example I. Graph 1 180 plots the difference between the average 
antibody fluorescence in the nuclear sampling region and cytoplasmic sampling region, 
NucCyt Difference verses Well #. Graph^Z 181 plots the average fluorescence of the 
antibody in the nuclear sampling region, NPl average, versus the Well #. Graph 3 182 

20 plots the average antibody fluorescence in the cytoplasmic sampling region, LIPl 
average, versus Well #. The software permits displaying data fi:om each cell. For 
example, Figure 18 shows a screen display 183. the nuclear image 184, and the 
fluorescent antibody image 185 for cell #26. 

NucCyt Difference referred to in graph 1 18Q of Figure 18 is the difference 

25 between the average cytoplasmic probe (fluorescent reporter molecule) intensity and 
the average nuclear probe (fluorescent reporter molecule) intensity. NPl average 
referred to in graph 2 181 of Figure 18 is the average of cytoplasmic probe (fluorescent 
reporter molecule) intensity within the nuclear sampling region. LlPl average referred 
to in graph 3 182 of Figure 18 is the average probe (fluorescent reporter molecule) 

30 intensity within the cytoplasmic sampling region. 

It will be understood by one of skill m the art that this aspect of the invention 
can be performed using other transcription factors that translocate from the cytoplasm 
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to the nucleus upon activation. In another specific example, activation of the c-fos 
transcription factor was assessed by defining its spatial position within cells. Activated 
c-fos is found only within the nucleus, while inactivated c-fos resides within the 
cytoplasm. 

5 3T3 cells were plated at 5000-10000 cells per well in a Polyfiltronics 96-weU 

plate. The cells were allowed to attach and grow overnight The cells were rinsed 
twice with 100 \il serum-free medium, incubated for 24-30 hours in serum-firee MEM 
culture medium, and then stimulated with platelet derived growth factor (PDGF-BB) 
(Sigma Chemical- Co., St Louis, MO) diluted directly into seram . free medium at 

10 concentrations ranging from 1-50 ng/ml for an average time of 20 minutes. 

Following stimulation, cells were fixed for 20 minutes in 3.7% formaldehyde 
solution in IX Hanks buffered saline solution (HBSS). After fixation, the cells were 
washed with HBSS to remove residual fixative, permeabilized for 90 seconds witii 
0.5% Triton X-100 solution in HBSS, and. washed twice witiiHBSS to remove residual 

15 detergent. The cells were then blocked for 1 5 minutes vwth a 0.1% solution of BSA in 
HBSS, and fiirthei" washed with HBSS prior to addition of diluted primary antibody 
solution. 

c-Fos rabbit polyclonal aitibody (Calbiochem, PC05) was diluted 1:50 in 
HBSS, and 50 ^1 of the dilution was appHed to each well. Cells were incubated m the 

20 presence of primary antibody for one hour at'room temperature, and tiien incubated for 
one hour at room temperature in a light tight container witii goat anti-rabbit secondary 
antibody conjugated to ALEXA™ 488 (Molecular Probes), diluted 1:500 firom a 100 
^g/ml stock in HBSS. Hoechst DNA dye (Molecular Probes) was tiien added at a 
1:1000 dilution of the manufectiirer's stock solution (10 mg/ml). The cells were then 

25 washed with HBSS, and the plate was sealed prior to analysis with the cell screening 
system of the invention. The data from these experiments demonstrated tiiat the 
metiiods of tiie invention could be used to measure transcriptional activation of c-fos by 
defining its spatial position within cells. 

One of skill in the art will recognize that while the following method is applied to 

30 detection of c-fos activation, it can be appUed to the analysis of any transcription factor 
that translocates from the cytoplasm to the nucleus upon activation. Examples of such 
transcription factors include, but are not limited to fos and jun homologs, NF-KB 
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(nuclear factor kappa from B cells), NFAT (nuclear factor of activated T-lymphocytes), 
and STATs (signal transducer and activator of transcription) factors (For exanaple, see 
Strehlow, I., and Schindler, C. 1998. J, Biol Chem. 273:28049-28056; Chow, et al. 
1997 Science. 278:1638-1641; Ding et al. 1998 J, Biol. Chem. 273:28897-28905; 

5 Baldwin, 1996. Annu Rev Immunol. 14:649-83; Kuo, C.T., and LM. Leiden. 1999. 
Annu Rev Immunol. 17:149-87; Rao, et al. 1997. Annu Rev Immunol. 15:707-47; 
Masuda,et al. 1998. Ce// 10:599-611; Hoey, T., and U. Schindler. 1998. Curr 
Opin Genet Dev. 8:582-7; Liu, et al. 1998. Curr Opin Immunol 10:271-8.) 

Thus, in this aspect of the invention, indicator cells are treated with test 

10 compounds and the distribution of luminescently labeled transcription factor is 
measured in space and time using a cell screening system, such as the one disclosed 
above. The luminescently labeled transcription factor may be expressed by or added to 
the cells either before, together with, or after contacting the cells with a test compound. 
For example, the transcription factor may be exTJressed as a luminescently 

15 labeled protein chimera by transfected indicator cells. Alternatively, the lununescently 
labeled transcription factor may be expressed, isolated, and bulk-loaded into the 
indicator cells as described above, or the transcription factor may be luminescently 
labeled after isolation. As a fiirther alternative, the transcription factor is expressed by 
the indicator cell, which is subsequently contacted with a luminescent labels such as an 

20 antibody, that detects the transcription factor. 

In a ftirther aspect, kits are provided for analyzing transcription factor activation, 
comprising an antibody that specifically recognizes a transcription factor of interest, 
and instructions for using the antibody for carrying out the methods described above. 
In a preferred embodiment, the transcription factor-specific antibody, or a secondary 

25 antibody that detects the transcription factor antibody, is luminescently labeled. In 
fiirther preferred embodiments, the kit contains cells that express the transcription 
factor of interest, and/or the kit contains a compound that is known to modify activation 
of the transcription factor of interest, including but not limited to platelet derived 
growth factor (PDGF) and serum, which both modify fos activation; and hiterleukm 

30 1(IL-1) and tumor necrosis factor (TNF), which both modify NF-KB activation. 

In another embodiment, the kit comprises a recombinant expression vector 
comprising a nucleic acid encoding a transcription factor of interest that translocates 
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from the cytoplasm to the nucleus upon activation, and instructions for using the 
expression vector to identify compounds that modify transcription factor activation in a 
cell of interest. Alternatively, the Icits contain a purified, luminescently labeled 
transcription factor. In a preferred embodiment, the transcription factor is expressed as 

5 a fusion protein with a luminescent protein, including but not limited to green 
fluorescent protein, luceriferase, or mutants or fragments thereof. In various preferred 
embodiments, the kit further contains cells that are transfected with the expression 
vector, an antibody or fragment that specifically bind to the transcription factor of 
interest, and/or a compound that is known to modify activation of the transcription. 

10 factorof interest (as above). 

b. Protein Kinases 

The cytoplasm to nucleus screening methods can also be used to analyze the 
activation of any protein kinase that is present in an inactive state in the cytoplasm and 

15" is transported to the nucleus upon activation, or that phosphorylates a substrate that 
translocates from the cytoplasm to the nucleus upon phosphorylation. Examples of 
appropriate protein kinases include, but are not limited to extraceUular signal-regulated 
protein kinases (ERKs), c-Jun amino-teraiinal kinases (JNKs), Fos regulating protein 
kinases (FRKs), p38 mitogen activated protem kinase (pSSli^APK), protein kinase^A 

20 (PKA), and mitogen activated protein kinase kinases (MAPKKs). (For example, see 
Hall, et al. 1999. J Biol Chem. 274:376-83; Han, et al. 1995. Biochim. Biophys. Acta. 
1265:224-227; Jaaro et al. 1997. Proc. Natl. Acad. Sci. U.S.A. 94:3742-3747; Taylor, et 
al. 1994. J. Biol. Chem. 269:308-318; Zlaao, Q., and F. S. Lee. 1999. J Biol Chem. 
274:8355-8; PaoUlloet al. 1999. J Biol Chem. 274:6546-52; Coso et al. 1995. Cell 

25 81 :1137-1146; Tibbies, L.A., and J.R. Woodgett. 1999. Cell Mol Life Sci. 55: 1230-54; 
Schaeffer, H.J., andM.J. Weber. 1999. Mol Cell Biol. 19:2435-44.) 

Alternatively, protein kinase activity is assayed by monitoring translocation of a 
luminescently labeled protein kinase substrate from the cytoplasm to the nucleus after 
being phosphorylated by the protein kinase of interest. In this embodiment, the 

30 substrate is non-phosphorylated and cytoplasmic prior to phosphorylation, and is 
translocated to the nucleus upon phosphorylation by the protein kinase. There is no 
requirement that the protein kinase itself translocates from the cytoplasm to flie nucleus 
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in this embodiment. Examples of such substrates (and the corresponding protein 
Idnase) include, but are not limited to c-jun (JNK substrate); fos (FRK substrate), and 
p38 (p38MAPK substrate). 

Thus, in these embodiments, indicator cells are treated with test compoimds and 

5 the distribution of luminescently labeled protein kinase or protein kinase substrate is 
measured in space and time using a cell screening system, such as the one disclosed 
above. The luminescently labeled protein kinase or protein kinase substrate may be 
expressed by or added to the cells either before, together with, or after contactmg the 
cells with a test compound. For example, the protein kinase or protein kinase substrate 

10 may be expressed as a luminescently labeled protein chimera by transfected indicator 
cells. Alternatively, the luminescently labeled protein kinase or protein kinase 
substrate may be expressed, isolated, and bulk-loaded into the indicator cells as 
described above, or the protein kinase or protein kinase substrate may be luminescently 
labeled after isolation. As a fiirther alternative, the protein kinase or protein kinase 

15 substrate is expressed by the indicator cell, which is subsequently contacted with a 
luminescent label, such as a labeled antibody, that detects the protein kinase or protein 
kinase substrate. 

In a further embodiment, protein kinase activity is assayed by monitoring the 
phosphorylation state (ie: phosphorylated or not phosphorylated) of a protein kinase 

20 substrate. In this embodiment, there is no requirement that either the protein kinase or 
the protein kinase substrate translocate from the cytoplasm to the nucleus upon 
activation. In a preferred embodiment, phosphorylation state is monitored by 
contacting the cells with an antibody that binds only to the phosphorylated form of the 
protein kmase substrate of interest (For example, as disclosed in U.S. Patent No. 

25 5,599,681). 

In another preferred embodiment, a biosensor of phosphorylation is used. For 
example, a luminescently labeled protein or fragment thereof can be fiised to a protein 
that has been engineered to contain (a) a phosphorylation site that is recognized by a 
protein kinase of interest; and (b) a nuclear localization signal that is unmasked by the 
30 phosphorylation. Such a biosensor will thus be translocated to the nucleus upon 
phosphorylation, and its translocation can be used as a measure of protein kinase 
activation. 
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In another aspect, kits are provided for analj^ing protein kinase activation, 
comprising a primary antibody that specifically binds to a protein kinase, a protein 
kinase substrate, or a phosphorylated form of the protein kinase substrate of interest and 
instructions for using the primary antibody to identify compounds that modify protein 
5 kinase activation in a cell of interest. In a preferred embodiment, the primary antibody, 
or a secondary antibody that detects the primary antibody, is luminescently labeled. In 
other preferred embodiments, the kit further comprises cells that express the protein 
kinase of interest, and/or a compound that is known to modify activation of the protein 
kinase of interest, including but not limited to dibutyryl cAMP (modifies PKA), 

10 forskolin (PKA), and anisomycin (p38MAPK). 

Alternatively, the kits comprise an expression vector encoding a protein kinase 
or a protein kinase substrate of interest that translocates fi"om the cytoplasm to the 
nucleus upon activation and instructions for using the expression vector to identify 
compounds that modify protein kinase activation in a cell of interest. Alternatively, the 

15 kits contain a purified, Imninescently labeled protein kinase or protein kinase substrate. 
In a preferred embodiment, the protein kinase or protein kinase substrate of interest is 
expressed as a fiision protein with a luminescent protein. In further preferred 
embodiments, the kit further comprises cells that are transfected with the expression 
vector, an antibody or firagment thereof that specifically binds to the protein kinase or 

20 protein kinase substrate of interest, and/or a compound that is known to modify 
activation of the protein kinase of interest, (as above) 

In another aspect, the present invention comprises a machine readable storage 
medium comprising a program containing a set of instructions for causing a cell 
screening system to execute the methods disclosed for analyzing transcription factor or 

25 protein kinase activation, wherein the cell screening system comprises an optical 
system with a stage adapted for holding a plate containing cells, a digital camera, a 
means for directing fluorescence or luminescence emitted from the cells to the digital 
camera, and a computer means for receiving and processing the digital data fi*om the 
digital camera. 

30 
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Example 2 Automated Screen for Compounds that Modify Cellular Morphology 

Changes in cell size are associated with a number of cellular conditions, such as 
hypertrophy, cell attachment and spreading, differentiation, growth and division, 
necrotic and progranraied cell death, cell motility, morphogenesis, tube formation, and 
colony formation. 

For example, cellular hypertrophy has been associated with a cascade of 
alterations in gene expression and can be characterized in cell culture by an alteration in 
cell size, that is clearly visible in adherent cells growing on a coverslip, 

Cell size can also be measured to deteraiine the attachment and spreading of 
adherent cells. Cell spreading is the result of selective binding of ceU surface receptors 
to substrate ligands and subsequent activation of signaling pathways to the 
cytoskeleton. Cell attachment and spreading to substrate molecules is an important step 
for the metastasis of cancer cells, leukocyte activation during the inflammatory 
response, keratinocyte movement during wound healing, and endothelial cell 
movement during angiogenesis. Compounds that affect these surface receptors, 
signaling pathways, or the cytoskeleton will affect cell spreading and can be screened 
by measuring cell size. 

Total cellular area can be monitored by labeling the entire cell body or the cell 
cytoplasm using cytoskeletal markers, cytosolic volume markers, or cell surface 
markers, in conjunction with a DNA label. Examples of such labels (many available 
from Molecular Probes (Eugene, Oregon) and Sigma Chemical Co. (St. Louis, 
Missoiui)) include the foUowong: 
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CELL SIZE AND AREA MARKERS 

Cytoslceletal Markers . 

• ALEXA™ 488 phalloidin (Molecular Probes. Oregon) 

• Tubulin-green fluorescent protein chimeras , _ 

• Cytokeratin-green fluorescent protein chimeras , - — - — 

• Antibodies to cvtoskeletal proteins . 

CytosoHc Volume Markers . , 

> Green fluorescent proteins 

• Chloromethvlfluorescein diacetate fCMFDA) , — : — „ 

• Calcein green ^ 

• BCECF/AM ester 

• Rhodamine dextraii — 

Cell Surface Markers for Lipid, Protein, or Oligosaccharide ^ 

• Dihexadecvl tetramethvlindocarbocvanine perchl orate (DilCl6) lipid dyes , _ — _ — 

• Triethylammonium propyl dibutvlamino stvrv l pyridinium (FU 4-64. FM 1-43) lipid dyes 

• MTTOTRAGKER'^*^ Green FM — : 

• Lectins to ohgosaccarides such as fluorescein concan avalin A or wheat germ agglutinm 

• SYPRO'^'^ Red non-specific protein markers , — — 

• Antibodies to yarious surface proteins such as ep idermal growth factor , 

_ » Biotin labeling of surface proteins followed by fluorescent stre oayidin labelemg 

Protocols for cell staining with these various agents are well known to those 
skilled in the art. Cells are stained live or after fixation and the cell area can be 

5 measured. For example, Uve cells stained with DiIC16 have homogeneously labeled 
plasma membranes, and the projected cross-sectional area of the cell is unifoimly 
discriminated from background by fluorescence intensity of the dye. Live cells stained 
with cytosolic stains such as CMFDA produce a fluorescence intensity that is 
proportional to cell thickness. Although cell labeling is dimmer in thin regions of the 

10 cell, total cell area can be discriminated from background. Fixed cells can be stained 
with cytoskeletal markers such as ALEXA™ 488 phalloidin that label polymerized 
actin. Phalloidin does not homogeneously stain the cytoplasm, but still permits 
discrimination of the total cell area from background. 

15 Cellular hypertrophy 

A screen to analyze cellular hypertrophy is implemented using the following 
strategy. Primary rat myocytes can be cultured in 96 well plates, treated with various 
compounds and then fixed and labeled with a fluorescent marker for the cell membrane 
or cytoplasm, or cytoskeleton, such as an antibody to a cell surface marker or a 
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fluorescent marker for the cytoskeleton like rhodamine-phalloidin, in combination with 
a DNA label like Hoechst. 

After focusing on the Hoechst labeled nuclei, two images are acquired, one of 
the Hoechst labeled nuclei and one of the fluorescent cytoplasm image. The nuclei are 

5 identified by thresholding to create a mask and then comparing the morphological 
descriptors of the mask with a set of user defined descriptor values. Each non-nucleus 
image (or "cytoplasmic image") is then processed separately. The original cytoplasm 
image can be thresholded, creating a cytoplasmic mask image. Local regions containing 
cells are defined around the nuclei. The limits of the cells in those regions are then 

10 defined by a local dynamic threshold operation on the same region in the fluorescent 
antibody image. A sequence of erosions and dilations is used to separate slightly 
touching cells and a second set of morphological descriptors is used to identify single 
cells. The area of the individual cells is tabulated in order to define the distribution of 
cell sizes for comparison with size data from normal and hypertrophic cells. 

15 Responses from entire 96-well plates (measured as average cytoplasmic 

area/cell) were analyzed by the above methods, and the results demonstrated that the 
assay will perform the same on a well-to-well, plate-to-plate, and day-to-day basis 
(below a 15% cov for maximum signal). The data showed very good correlation for^ 
each day, and that there was no variability due to well position in tKe plate, 

20 The following totals can be computed for the field. The aggregate whole 

nucleus area is the number of nonzero pixels in the nuclear mask. The average whole 
nucleus area is the aggregate whole nucleus area divided by the total nmnber of nuclei. 
For each cj^oplasm" image several values can be computed. These are the total 
cytoplasmic area, which is the count of nonzero pixels in the cytoplasmic mask. The 

25 aggregate cytoplasm intensity is the sum of the intensities of all pixels in the 
cytoplasmic mask. The cytoplasmic area per nucleus is the total cytoplasmic area 
divided by the total nucleus count. The cytoplasmic intensity per nucleus is the 
aggregate cytoplasm intensity divided by the total nucleus count. The average 
cytoplasm intensity is the aggregate cytoplasm intensity divided by the cytoplasm area. 

30 The cytoplasm nucleus ratio is the total cytoplasm area divided by the total nucleus 
area. { > 
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Additionally, one or more fluorescent antibodies to other cellular proteins, such 
as the major muscle proteins actin or myosin, can be included. Images of these 
additional labeled proteins can be acquired and stored with the above images, for later 
review, to identify anomalies in the distribution and morphology of these proteins in 
5 hypertrophic cells. This example of a multi-parametric screen allows for simultaneous 
analysis of cellular hypertrophy and changes in actin or myosin distribution. 

One of skill in the art will recognize that while the example analyzes myocyte 
hypertrophy, the methods can be applied to analyzing hypertrophy, or general 
morphological changes in any cell type. 

10 . ' 

Cell morphology assays for prostate carcinoma 

Cell spreading is a measure of the response of cell surface receptors to substrate 
attachment ligands. Spreading is proportional to the ligand concentration or to the 
concentration of compounds that reduce receptor-ligand function. One example of 

15 selective cell-substrate attachment is prostate carcinoma cell adhesion to the 
extracellular matrix protein collagen. Prostate carcinoma cells metastasize to bone via 
selective adhesion to collagen. 

Compounds that interfere with metastasis of prostate carcinoma cells were 
screened as foUows. PC3 human prostate carcinoma cells were cultured/in media with 

20 appropriate stimulants and are passaged to collagen coated 96 well plates. Ligand 
concentration can be varied or inhibitors of cell spreading can be added to the wells. 
Examples of compounds that can affect spreading are receptor antagonists such as 
integrin- or proteoglycan-blocking antibodies, signaling inhibitors including 
phosphatidyl inositol-3 kinase inhibitors, and cytoskeletal inhibitors such as 

25 cytochalasin D. After two hours, cells were fixed and stained with ALEXA™ 488 
phalloidin (Molecular Probes) and Hoechst 33342 as per the protocol for cellular 
hypertrophy. The size of cells under these various conditions, as measured by 
cytoplasmic staining, can be distinguished above background levels. The number of 
cells per field is detennined by measuring the number of nuclei stained with the 

30 Hoechst DNA dye. The area per cell is found by dividing the cytoplasmic area 
(phalloidin image) by the cell number (Hoechst image). The size of cells is 
proportional to the ligand-receptor function. Since the area is detemiined by ligand 
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concentration and by the resultant function of the cell, drug efficacy, as well as drug 
potency, can be determined by this cell-based assay. Other measurements can be made 
as discussed above for cellular hypertrophy. 

The methods for analyzing cellular morphology can be used in a combined high 

5 throughput-high content screen. In one example, the high throughput mode scans the 
whole well for an increase in fluorescent phalloidin intensity. A threshold is set above 
which both nuclei (Hoechst) and cells (phalloidin) are measured in a high content 
mode. In another example, an environmental biosensor (examples include, but are not 
limited to, those biosensors that are sensitive to calcium and pH changes) is added to 

10 the cells, and the cells are contacted with a compound. The cells are scanned in a high 
throughput mode, and those wells that exceed a pre-determined threshold for 
luminescence of the biosensor are scanned in a high content mode. 

In a further aspect, kits are provided for analyzing cellular morphology, 
comprising a luminescent compound that can be used to specifically label the cell 

15 cytoplasm, membrane, or cytoskeleton (such as those described above), and 
instructions for using the luminescent compound to identify test stimuli that induce or 
inhibit changes in cellular morphology according to the above methods. In a preferred 
embodiment, the kit further comprises a luminescent marker for cell nuclei. In a further 
preferred embodiment, the kit comprises at least one compound that is known to 

20 modify cellular morphology, including, but not limited to integrin- or proteoglycan- 
blocking antibodies, signaluig inhibitors includuig phosphatidyl inositol-3 kuiase 
inhibitors, and cytoskeletal inhibitors such as cytochalasin D. 

In another aspect, the present invention comprises a machine readable storage 
medium comprising a program containing a set of instructions for causing a cell 

25 screening system to execute the disclosed methods for analyzing cellular niorphology, 
wherein the cell screening system comprises an optical system with a stage adapted for 
holding a plate containmg cells, a digital camera, a means for directing fluorescence or 
luminescence emitted from the cells to the digital camera, and a computer means for 
receiving and processing the digital data from the digital camera. . 

30 Example 3 Dual Mode High Throughput and High-Content Screen 

The following example is a screen for activation of a G-protein coupled receptor 
(GPCR) as detected by the translocation of the GPCR from the plasma membrane to a 
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proximal nuclear location. This example illustrates how a high throughput screen can 
be coupled with a high-content screen in the dual mode System for Cell Based 
Screening. 

G-protem coupled receptors are a large class of 7 trans-membrane domain cell 

5 surface receptors. Ligands for these receptors stimulate a cascade of secondary signals 
in the cell, which may include, but are not limited to, Ca^ transients, cyclic AMP 
production, inositol triphosphate (IP3) production and phosphorylation. Each of these 
signals are rapid, occuring in a matter of seconds to minutes, but are also generic. For 
example, many different GPCRs produce a secondary Ca"^ signal when activated. 

1 0 Stimulation of a GPCR also results in the transport of that GPCR from the cell surface 
membrane to an internal, proximal nuclear compartment. This internalization is a much 
more receptor-specific indicator of activation of a particular receptor than are the 
secondary signals described above. 

Figure 19 illustrates a dual mode screen for activation of a GPCR. Cells 

1 5 carrying a stable chimera of the GPCR with a blue fluorescent protein (BFP) would be 
loaded with the acetoxymethylester form of Fluo-3, a cell permeable calcium indicator 
(green fluorescence) that is trapped in Uving cells by the hydrolysis of the esters. They 
would then be deposited into the wells of a microtiter plate 601. The wells would then 
be treated with an array of test compounds using a fluid delivery system, and a short 

20 sequence of Fluo-3 images of the whole microtiter plate would be acquired and 
analyzed for wells exhibiting a calcium response (i.e., high throughput mode). The 
images would appear like the illustration of the microtiter plate §Ql in Figure 19. A 
small number of wells, such as wells C4 and E9 in the illustration, would fluoresce 
more brightly due to the Ca** released upon stimulation of the receptors. The locations 

25 of wells containing compounds that induced a response 6Q2, would then be transferred 
to the HCS program and the optics switched for detailed cell by cell analysis of the blue 
fluorescence for evidence of GPCR translocation to the perinuclear region. The bottom 
of Figure 19 illustrates the two possible outcomes of the analysis of the high resolution 
cell data. The camera images a sub-region 604 of the well area 603, producing images 

30 of the fluorescent cells In well C4, the uniform distribution of the fluorescence in 
the cells indicates that the receptor has not internalized, hnplying that the Ca"*^ response 
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seen was the result of the stimulation of some other signalling system in the cell. The 
cells in well E9 606 on the other hand, clearly indicate a concentration of the receptor 
in the perinuclear region clearly indicating the full activation of the receptor. Because 
only a few hit wells have to be analyzed with high resolution, the overall throughput of 
5 the dual mode system can he quite high, comparable to the high throughput system 
alone. 

Example 4 Kinetic High Content Screen 

The following is an example of a screen to measure the kinetics of 

10 internalization of a receptor. As described above, the stimulation of a GPCR, results in 
the internalization of the receptor, with a time course of about 15 min. Simply 
detecting the endpoint as internalized or not, may not be sufficient for defining the 
potency of a compound as a GPCR agonist or antagonist. However, 3 time points at 5 
min intervals would provide information not only about potency during the tune course 

15 of measurement, but would also allow extrapolation of the data to much longer time 
periods. To perform this assay, the sub-region would be defined as two rows, the 
sampling interval as 5 minutes and the total number of time points 3. The system 
/ would then start by scanning two rows, and then adding reagent to the two rows, 
establishing the time=0 reference. After reagent addition, the system would again scan 

20 the two row sub-region acquiring the first time point data. Since this process would 
take about 250 seconds, including scanning back to the beginning of the sub-region, the 
system would wait 50 seconds to begin acquisition of the second time point Two more 
cycles would produce the three time points and the system would move on to the 
second 2 row sub-region. The final two 2-row sub-regions would be scanned to finish 

25 all the wells on the plate, resulting in four time points for each well over the whole 
plate. Although the time points for the wells would be offset slightly relative to 
time=0, the spacing of the time points would be very close to the required 5 minutes, 
and the actual acquisition times and results recorded with much greater precision than 
in a fixed-cell screen. 

30 
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Example S High-content screen of human glucocorticoid receptor translocation 

One class of HCS involves the drug-induced dynamic redistribution of 
intracellular constituents. The human glucocorticoid receptor (hGR), a single "sensor" 
in the complex environmental response machinery of the cell, binds steroid molecules 
that have diffused into the cell. The ligand-receptor complex translocates to the 
nucleus where transcriptional activation occurs (Htun et al., Proc. Natl. Acad. Sci. 
93:4845,1996). 

hi general, hormone receptors are excellent drug targets because their activity 
lies at the apex of key intracellular signaling pathways. Therefore, a high-content 
screen of hGR translocation has distinct advantage over in vitro ligand-receptor binding 
assays. The availability of up to two more channels of fluorescence in the cell 
screening system of the present invention permits the screen to contain two additional 
parameters in parallel, such as other receptors, other distinct targets or other cellular 
processes. 

15 Plasmid construct. A eukaryotic expression plasmid containing a coding 

sequence for a green fluorescent protein - human glucocorticoid receptor (GFP-hGR) 
chimera was prepared using GFP mutants (Pahn et al., Nat. Struct. Biol. 4:361 (1997). 
The construct was used to transfect a human cervical carcinoma cell line (HeLa). 

<:ell preparation and transfection. HeLa ceUs (ATCC CCL-2) were trypsinized 

20 and plated using DMEM containing 5% charcoal/dextran-treated fetal bovine serum 
(FBS) (HyClone) and 1% penicilUn-streptomycin (C-DMEM) 12-24 hours prior to 
transfection and incubated at 37°C and 5% CO2 . Transfections were perfonned by 
calcium phosphate co-precipitation (Graham and Van der Eb, Virology 52:456, 1973; 
Sambrook et al., (1989)! Molecular Cloning: A Laboratory Manual, Second ed. Cold 

25 Spring Harbor Laboratory Press, Cold Spring Harbor, 1989) or with Lipofectamine (Life 
Technologies, Gaithersburg, MD). For the calcium phosphate transfections, the 
medium was replaced, prior to transfection, with DMEM containing 5% 
charcoaydextran-treated FBS. Cells were incubated with the calcium phosphate-DNA 
precipitate for 4-5 hours at 37°C and 5% CO2, washed 3-4 times with DMEM to 

30 remove the precipitate, followed by the addition of C-DMEM. 

Lipofectamine transfections were performed in serum-free DMEM without 
antibiotics according to the manufacturer's instructions (Life Technologies, 
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Gaithersburg, MD). Following a 2-3 hour incubation with the DNA-liposome 
complexes, the medium was removed and replaced with C-DMEM. All transfected 
cells in 96-well microliter plates were incubated at 33°C and 5% CO2 for 24-48 hours 
prior to drug treatment Experiments were performed with the receptor expressed 

5 transiently in HeLa cells. 

Dexamethasone induction of GFP-hGR translocation. To obtain receptor- 
ligand translocation kinetic data, nuclei of transfected cells were first labeled, with 5 
M-g/ml Hoechst 33342 (Molecular Probes) in C-DMEM for 20 minutes at 33**C and 5% 
CO2. Cells were washed once in Hank's Balanced Salt Solution (HBSS) followed by 

10 the addition of 100 nM dexamethasone in HBSS with 1% charcoal/dextran-treated 
FBS. To obtain fixed time point dexamethasone titration data, transfected HeLa cells 
were first washed with DMEM and then incubated at 33°C and 5% CO2 for 1 h in the 
presence of 0 - 1000 nM dexamethasone in DMEM containing 1% charcoal/dextran- 
treated FBS. Cells .were analyzed-live or they were rinsed with HBSS, fixed for* 15 min 

15 with 3.7% formaldehyde in HBSS, stEuned with Hoechst 33342, and washed before 
analysis. The intracellular GFP-hGR fluorescence signal was not diminished by this 
fixation procedure. 

Image acquisition and analysis. Kinetic data were collected by acquiring 
fluorescence image pairs (GFP-hGR and Hoechst 33342-la6eled nuclei) from fields of 

20 living cells at 1 min intervals for 30 min after the addition of dexamethasone. 
Likewise, image pairs were obtained firom each well of the fixed time point screening 
plates 1 h after the addition of dexamethasone. In both cases, the image pairs obtained 
at each time point were used to define nuclear and cytoplasmic regions in each cell. 
Translocation of GFP-hGR was calculated by dividing the integrated fluorescence 

25 intensity of GFP-hGR in the nucleus by the integrated fluorescence intensity of the 
chimera in the cytoplasm or as a nuclear-c3rtoplasmic difference of GFP fluorescence. 
In the fixed time point screen this translocation ratio was calculated firom data obtained 
from at least 200 cells at each concentration of dexamethasone tested. Drag-induced 
translocation of GFP-hGR from the c3^oplasm to the nucleus was therefore correlated 

30 with an increase in the translocation ratio. 

Results, Figure 20 schematically displays the drag-induced cytoplasm 253 to 
nucleus 252 translocation of the himian glucocorticoid receptor. The upper pair of 
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schematic diagrams depicts tlie localization of GFP-hGR within the cell before 25Q (A) 
and after 251 (B) stimulation with dexamethasone. Under these experimental 
conditions, the drug mduces a large portion of the cytoplasmic GFP-hGR to translocate 
into the nucleus. This redistribution is quantified by determining the integrated 

5 mtensities ratio of the cytoplasmic and nuclear fluorescence in treated 25i and 
untreated 254 cells. The lower pair of fluorescence micrographs show the dynamic 
redistribution of GFP-hGR in a single cell, before 254 and after 2^5 treatment. The 
HCS is performed on wells containing hundreds to thousands of transfected ceUs and 
the translocation is quantified for each ceU in the field exhibiting GFP fluorescence. 

10 Although the use of a stably transfected cell line would yield the most consistently 
labeled cells, the heterogeneous levels of GFP-hGR expression induced by transient 
transfection did not interfere with analysis by the cell screening system of the present 
invention. 

To execute the screen, the cell screening system scans each well of the plate, 

15 images a population of cells in each, and analyzes cells individually. Here, two 
channels of fluorescence are used to define the cytoplasmic and nuclear distribution of 
the GFP-hGR within each cell. Depicted in Figure 21 is the graphical user interface of 
the cell screening system near the end of a GFP-hGR screen. The user interface depicts 
the parallel data collection and analysis capabiUty of the system. The windows labeled 

20 "Nucleus" 261 and "GFP-hGR" 262 show the pair of fluorescence images being 
obtained and analyzed in a single field. The window labeled "Color Overlay" 260 is 
formed by pseudocoloring the above images and merging them so the user can 
immediately identify cellular changes. Within the "Stored Object Regions" window 
265, an image containing each analyzed ceU and its neighbors is presented as it is 

25 archived. Furthermore, as the HCS data are being collected, they are analyzed, in this 
case for GFP-hGR translocation, and translated into an immediate "hit" response. The 
96 well plate depicted in the lower window of the screen W. shows which wells have 
met a set of user-defined screening criteria. For example, a white-colored well 26£ 
indicates that the drag-induced translocation has exceeded a predetennined threshold 

30 value of 50%. On the other hand, a black-colored well 220 indicates that the drug being 
tested induced less than 10% translocation. Gray-colored wells 268 indicate "hits" 
where the translocation value fell between 10% and 50%. Row ••£" on the 96 well 

53 



BNSDOCID: <WO 0050B72A3JA> 



wo 00/50872 



PCT/USOO/04794 



plate being analyzed 266 shows a titration with a drug known to activate GFP-hGR 
translocation, dexamethasone. This example screen used only two fluorescence 
channels. Two additional channels (Channels 3 263 and 4 264) are available for 
parallel analysis of other specific targets, cell processes, or cytotoxicity to create 
multiple parameter screens. 

There is a linlc between tiie image database and the information database that is 
a powerful tool during the validation process of new screens. At the completion of a 
screen, the user has total access to image and calculated data (Figure 22). The 
comprehensive data analysis package of the cell screening system allows the user to 
examine HCS data at multiple levels. Images 22^ and detailed data in a spread sheet 
279 for individual cells can be viewed separately, or summary data can be plotted. For 
example, the calculated results of a single parameter for each cell in a 96 well plate are 
shown in the panel labeled Graph 1 275. By selecting a single point in the graph, the 
user can display the entire data set for a particular cell that is recalled firom an existing 
database. Shown here are the image pair 276 and detailed fluorescence and 
morphometric data from a single cell (Cell #118, gray line 277). The large graphical 
insert 278 shows the results of dexamethasone concentration on the translocation of 
GFP-hGR. Each point is the average of data from at least 200 cells. The calculated 
ECso for dexamethasone in this assay is 2 nM. 

A powerful, aspect of HCS with the cell screening system is the capability of 
kinetic measurements using multicolor fluorescence and morphometric parameters in 
living cells. Temporal and spatial measurements can be made on single cells within a 
population of cells in a field. Figure 23 shows kinetic data for the dexamethasone- 
induced translocation of GFP-hGR in several cells within a single field. Human HeLa 
cells transfected vvith GFP-hGR were treated with 100 nM dexamethasone and the 
translocation of GFP-hGR was measured over time in a population of single cells. The 
graph shows the response of transfected cells 285, 2M. m and 2S8 and non- 
transfected cells 28£. These data also illustrate the abiUty to analyze cells with 
diffarent expression levels. 
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Example 6 High-content screen of drug-induced apoptosis 

Apoptosis is a complex cellular program that involves myriad molecular events 
and pathways. To understand the mechanisms of drug action on this process, it is 
essential to measure as many of these events within cells as possible with temporal and 

5 spatial resolution. Therefore, an apoptosis screen that requires little cell sample 
preparation yet provides an automated readout of several apoptosis-related parameters 
would be ideal. A cell-based assay designed for the cell screening system has been 
used to simultaneously quantify several of tiie morphological, organellar, and 
macromolecular halhnarics of paclitaxel-induced apoptosis. 

10 Cell preparation. The cells chosen for this study were mouse connective tissue 

fibroblasts (L-929; ATCC CCL-1) and a highly invasive gUoblastoma cell line (SNB- 
19; ATCC CRL-2219) (Welch et al.. In Vitro Cell Dev. Biol 31:610, 1995). The day 
before treatment with an apoptosis inducing drug, 3500 cells were placed mto each well 
of a 96-well plate and incubated overnight at 37^.C in a humidified 5% CO2 

15 atmosphere. The following day, the culture medium was removed firom each well and 
replaced with fresh medium containing various concentrations of paclitaxel (0 - 50 
\}M) from a 20 mM stock made in DMSO. The maximal concentration of DMSO used 
in these experiments was 0.25%. The cells were then incubated for 26 h as above. At 
the end of the paclitaxel treatment period, each well received fresh medium containing 

20 750 nM MitoTracker Red (Molecular Probes; Eugene, OR) and 3 ^ig/ml Hoechst 33342 
DNA-binding dye (Molecular Probes) and was incubated as above for 20 min. Each 
well on the plate was then washed with HBSS and fixed with 3.7% foraialdehyde in 
HBSS for 15 min at room temperature. The formaldehyde was washed out with HBSS 
and the cells were peraieabiUzed for 90 s with 0.5% (v/v) Triton X-100, washed with 

25 HBSS, incubated with 2 U ml'^ Bodipy FL phallacidin (Molecular Probes) for 30 min, 
and washed with HBSS. The wells on the plate were then filled with 200 \xX HBSS, 
sealed, and the plate stored at 4'*C if necessary. The fluorescence signals from plates 
stored this way were stable for at least two weeks after preparation. As in the nuclear 
translocation assay, fluorescence reagents can be designed to convert this assay into a 

30 live cell high-content screen. 

Image acquisition and analysis on the ArrayScan System, The fluorescence 
mtensity of intracellular MitoTracker Red, Hoechst 33342, and Bodipy FL phallacidin 
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was measured with the cell screening system as described stq)ra. Moiphometric data 
from each pair of images obtained from each well was also obtained to detect each 
object in the image field {e.g.^ cells and nuclei), and to calculate its size, shape, and 
integrated intensity. 

5 Calculations and output A total of 50-250 cells were measured per image 

field. For each field of cells, the following calculations were performed: (1) The 
average nuclear area (jim^) was calculated by dividing the total nuclear area in a field 
by the number of nuclei detected. (2) The average nuclear perimeter (]im) was 
calculated by dividing the sum of the perimeters of all nuclei in a field by the number 

10 of nuclei detected in that field. Highly convoluted apoptotic nuclei had the largest 
nuclear perimeter values. (3) The average nuclear brightness was calculated by dividing 
the integrated intensity of the entire field of nuclei by the number of nuclei in that field. 
An increase in nuclear brightness was correlated with increased DNA content. (4) The 
average cellular brightness was calculated by dividing- the integrated mtensity of. an 

15 entire field of cells stained with MitoTracker dye by the number of nuclei in that field. 
Because the amount of MitoTracker dye that accumulates withm the mitochondria is 
proportional to the mitochondrial potential, an increase in fhe average cell brightness is 
consistent with an increase in mitochondrial potential. (5) The average cellular 
brightness was also calculated by dividing the integrated mtensity of an entire field of 

20 cells stained with Bodipy FL phallacidin dye by the number of nuclei in that field. 
Because the phallotoxins bind with high affinity to the polymerized form of actin, the 
amount of Bodipy FL phallacidin dye that accumulates within the cell is proportional to 
actin polymerization state. An increase in the average cell brightness is consistent with 
an increase in actin polymerization. 

25 Results. Figure 24 (top panels) shows the changes paclitaxel uiduced in the 

nuclear morphology of L-929 cells. Lacreasing amounts of paclitaxel caused nuclei to 
enlarge and fragment 293, a hallmark of apoptosis. Quantitative analysis of these and 
other images obtained by the cell screening system is presented in the same figure. 
Each parameter measured showed that the L-929 cells 296 were less sensitive to low 

30 concentrations of paclitaxel than were SNB-19 cells 227. At higher concentrations 
though, the L-929 cells showed a response for each parameter measured. The 
multiparameter approach of this assay is usefiil in dissecting the mechanisms of drag 
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action. For example, the area, brightness, and fragmentation of the nucleus 228 and 
actin polymerization values 224 reached a maximum value when SNB-19 cells were 
treated with 10 nM pacUtaxel (Figure 24; top and bottom graphs). However, 
mitochondrial potential 295 was minimal at the same concentration of paclitaxel 

5 (Figure 24; middle graph). The fact that aU the parameters measured approached 
control levels at increasing pacUtaxel concentrations (>10 nM) suggests that SNB-19 
cells have low afBnity drug metaboUc or clearance pathways that are compensatory at 
sufficiently high levels of the drag. Contrasting the drug sensitivity of SNB-19 cells 
297. L-929 showed a different response to paclitaxel 226. These fibroblastic ceUs 

10 showed a maximal response in many parameters at 5 jiM paclitaxel, a 500-fold Wgher 
dose than SNB-19 cells. Furthennore, the L-929 cells did not show a sharp decrease in 
mitochondrial potential 295 at any of the paclitaxel concentrations tested. This result is 
consistent with the presence of unique apoptosis pathways between a normal and 
cancer cell Une. Hierefore, these results indicate that a relatively simple fluorescence 

15 labeling protocol can be coupled with the cell screening system of the present invention 
to produce a high-content screen of key events involved in programmed ceU death. 

Background 

A key to the mechanism of apoptosis .was the discovery that, irrespective of the 
20 lethal stimulus, death results in identical apoptotic morphology that includes cell and 
organelle dismantling and repackaging. DNA cleavage to nucleosome sized fragments, 
and engulfinent of the fragmented cell to avoid an inflammatory response. Apoptosis is 
therefore distmct from necrosis, which is mediated more by acute trauma to a cell, 
resulting in spillage of potentially toxic and antigenic ceUular components into the 
25 intercellular milieu, leading to an mflammatory response. 

The criteria for determining whether a cell is undergoing apoptosis (Wyllie et 
al. 1980. Int Rev Cytol 68:251-306; Thompson, 1995. Science. 267:1456-62; Majno 
and Joris. 1995. Am J Pathol. 146:3-15; Allen et al. 1998. Cell Mol Life Sci. 54:427-45) 
include distinct morphological changes in the appearance of the cell, as well as 
30 • alterations in biochemical and molecular markers. For example, apoptotic cells often 
undergo cytoplasmic membrane blebbing, their chromosomes r^idly condense and 
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aggregate around the nuclear periphery, the nucleus fragments, and small apoptotic 
bodies are formed. In many, but not all, apoptotic cells, chromatin becomes a target for 
specific nucleases that cleave the DNA. 

Apoptosis is commonly accompanied by a characteristic change in nuclear 

5 morphology (chromatin condensation or fragmentation) and a step-wise fragmentation 
of DNA culminating m the formation of mono- and/or oligomeric fragments of 200 
base pairs. Specific changes in organellar fimction, such as mitochondrial membrane 
potential, occur. In addition, specific cysteine proteases (caspases) are activated, which 
catalyzes a highly selective pattern of protein degradation by proteolytic cleavage after 

10 specific aspartic acid residues. In addition, the external surface exposure of 
phosphatidylserine residues (normally on the inner membrane leaflet) allows for the 
recognition and elimination of apoptotic cells, before the membrane breaks up and 
cytosol or organeUes spill into the intercellular space and elicit inflammatoiy reactions. 
Moreover, cells undergoing apoptosis tend to shrink, while also -having a reduced 

15 intracellular potassium level. 

The general patterns of apoptotic signals are very similar among different cell types 
and apoptotic inducers. However, the details of tiie pathways actijally vary significantly 
depending on cell type and inducer. The dependence and independence of various signal 
transduction pathways involved in apoptosis are currentiy topics of intense research. We 

20 show here ttiat the pathway also varies depending upon tiie dose of tiie inducer in specific 
cell types. 

Nuclear Morphology 

Cells undergoing apoptosis generally exhibit two types of nuclear change, 
25 fragmentation or condensation ((Majno and Joris, 1995), (Eamshaw, 1995)). The 

response in a given cell type appears to vary dependmg on the apoptotic inducer. 

During nuclear fragmentation, a circular or oval nucleus becomes increasingly lobular. 

Eventually, tiie nucleus fragments dramatically into multiple sub-nuclei. Sometimes the 

density of the chromatin within tiie lobular nucleus may show spatial variations in 
30 distaibution (heterochromatization), approximatmg tiie margination seen in nuclear 

condensation. 
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Nuclear condensation has been reported in some cell types, such as MCF-7 
(Saunders et al. 1997. Int J Cancer. 70:214-20). Condensation appears to arise as a 
consequence of the loss of structural integrity of the euchromatin, nuclear matrix and 
nuclear lamina (Hendzel et al. 1998. J Biol Chem. 273:24470-8). During nuclear 
5 condensation, the chromatin concentrates near the margin of the nucleus, leading to the 
overall shrinkage of the nucleus. Thus, the use of nuclear morphology as a measure of 
apoptosis must take both condensation and fragmentation into account 

Material and Methods 

10 Cells were plated mto 96-well plates at densities of 3 x 10^ to 1 x lO" cells/weU. 

The following day apoptotic inducers were added at indicated concentrations and cells 
were incubated for indicated time periods (usually 16-30 hours). The next day medium 
was removed and cells were stained with 5 |J.g/ml Hoechst (Molecular Probes, Inc.) in 
fresh medium and inculjated for 30 minutes at 37°C. Cells were washed in Hank's 

15 Balanced Salt Solution (HBSS) and fixed with 3.7% formaldehyde in HBSS at room 
temperature. Cells were washed 2X with HBSS at room temperature and the plate was 
sealed. 

Quantitation of changes in nuclear morphology upon induction of apoptosis was 
accomplished by (1) measuring the effective size- of the nuclear region; and (2) 
20 measuring the degree of convolution of the perimeter. The size parameter provides the 
more sensitive measure of nuclear condensation, whereas the perimeter measure 
provides a more sensitive measure of nuclear fragmentation. 

Results & Discussion 

25 L929 cells responded to both staurosporine (30 hours) and paclitaxel (30 hours) 

with a dose-dependent change in nuclear morphology (Fig 25A and 25B). BHK cells 
iUustrated a slightly more compUcated, yet clearly visible response. Staurosporine 
appeared to stimulate nuclear condensation at lower doses and nuclear fragmentation at 
higher doses (Fig 25C and 25D). In contrast, paclitaxel induced a consistent increase in 

30 nuclear fragmentation with increasing concentrations. The response of MCF-7 cells 
varied dramatically depending upon the apoptotic inducer. Staurosporine appeared to 
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elicit nuclear condensation whereas paclitaxel induced nuclear fragmentation (Fig 25E 
and25F), 

Figure 26 illustrates the dose response of cells in terms of both nuclear size and 
nuclear perimeter convolution. There appears to be a swelling of the nuclei that 
5 precedes the fragmentation. 

Result of evaluation: Differential responses by cell lines and by apoptotic 
inducers were observed in a dose dependent maimer, indicating that this assay will be 
useful for detecting changes in the nucleus characteristic of apoptosis. 

10 Actin reorganization 

We assessed changes in the actin cytoskeleton as a potential parameter related 
to apoptotic changes. This was based on preluninary observations of an early increase 
in f-actin content detected with fluorescent phalloidin labeling, an f-actin specific stain 
(our unpubhshed data; Levee et al. 1996. Am JPhysioL 111 :C1981-92; Maekawa et al. 

15 1996. Clin Exp Immwtol, 105:389-96). Changes in the actin cytoskeleton during 
apoptosis have not been observed in all cell types, (Endresen et al. 1995. Cytometry. 
20:162-71, vanEngeland et al. 1997. Exp Cell Res. 235:421-30). 
Material and Methods 

Cells were plated in 96-well plates at densities of 3 x 10^ to 1 x lO"^ cells/well. 

20 The following day apoptotic inducers were added at indicated concentrations. Cells 
were incubated for the indicated tune periods (usually 16-30 hours). The next day the 
medixun was removed and cells were stained with 5 |ag/ml Hoechst (Molecular Probes, 
Inc.) in fresh medium and incubated for 30 minutes at 30°C. Cells were washed in 
HBSS and fixed with 3.7% formaldehyde in HBSS at room temperature. Plates were 

25 washed with HBSS and permeabilized with 0.5% v/v Triton X-100 in HBSS at room 
temperature. Plates were washed in HBSS and stained with 100 (il of lU/ml of Alexa 
488 Phalloidin stock (100 |il/well. Molecular Probes, Lie). Cells were washed 2X with 
HBSS at RT and the plate was sealed. 

Quantitation of f-actin content was accompUshed by measuring the intensity of 

30 phalloidin staining around the nucleus. This was determined to be a reasonable 

approximation of a full cytoplasnuc average of the intensity. The mask used to 

approximate this cj^oplasmic measure was derived from the nuclear mask defined by 
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the Hoechst stain. Derivation was accomplished by combinations of erosions and 
dilations. 

Results and Discussion 

5 Changes in f-actin content varied based on ceU type and apoptotic inducer (Fig 

27). Staurosporine (30 hours) induced increases in f-actin in L929 (Fig. 27 A) and BHK 
(Fig. 27B) cells. MCF-7 cells exhibited a concaitration-dependent response. At low 
concentrations (Fig. 27E) there appeared to be a decrease in f-actin content. At higher 
concentrations, f-actin content increased. PacUtaxel (30 hours) treatment led to a wide 

10 variety of response's. L929 ceUs responded with graded increases in f-actin (Fig. 27B) 
whereas both BHK and MCF-7 responses were highly variable (Figs. 27D & 27F, 
respectively). 

Result of Evaluation: Both increases and decreases in signal mtensity were 
15 measured for several ceU hnes and found to exhibit a concentration dependent 
response. For certain cell Une/apoptotic inducer pairs this could be a statistically 
significant apoptotic indicator. 

Changes in Mitochondrial Mass/Potential 
20 Introduction 

Changes in mitochondria play a central role in apoptosis (Henkart and 
Grinstein. 1996. J Exp Med. 183:1293-5). Mitochondria release apoptogenic factors 
through the outer membrane and dissipate the electrochemical gradient of the inner 
membrane. This is thought to occur via formation of the mitochondria permeability 

25 transition (MPT), although it is apparently not true in all cases. An obvious 
manifestation of the formation of the MPT is collapse of the mitochondrial membrane 
potential. Inhibition of MPT by pharmacological mtervention or mitochondrial 
expression of the anti-apoptotic protein Bcl-2 prevents cell death, suggesting the 
formation of the MPT may be a rate-limiting event of the death process (For review 

30 see: Kroemer et al. 1998. Annu Rev Physiol. 60:619-42). It has also been observed that 
mitochondria can proliferate during stimulation of apoptosis (Mancini et al. 1997. J 
Cell Biol. 138:449-69; Camilleri-Broet et al. l99Z._Exp Cell Res. 239:277-92). 
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One approach for measuring apoptosis-induced changes in mitochondria is to 
measure the mitochondrial membrane potential. Of the methods available, the simplest 
measure is flie redistribution of a cationic dye that distributes within intracellular 
organelles based on the membrane potential. Such an approach traditionally requires 

5 live cells for the measurements. The recent introduction of the MitoTracker dyes (Foot 
et al. 1997. Cytometry. 27:358-64; available from Molecular Probes, Inc., Oregon) 
provides a means of measuring mitochondrial membrane potential after fixation. 

Given the observations of a possible increase in mitochondrial mass dining 
apoptosis, the amount of dye labeling the mitochondria is related to both membrane 

10 potential and the number of mitochondria. If the number of mitochondria remains 
constant then the amount of dye is directly related to the membrane potential. If the 
number of mitochondria is not constant, then the signal will likely be dominated by the 
increase in mass (Reipert et al, 1995. Exp Cell Res. 221:281-8), 

Probes are available that allow a clear separation between changes in mass and 

15 potential in HCS assays. Mitochondrial mass is measured directly by labeling with 
Mitotracker Green FM (Poot and Pierce, 1999, Cytometry. 35:311-7; available from 
Molecular Probes, Inc., Oregon). The labeling is independent of mitochondrial 
membrane potential but proportional to mitochondrial mass. This also provides a 
means of normalizing other mitochondrial measures in each cell with respect to 

20 mitochondrial mass. 

Material and Methods 

Cells were plated into 96-well plates at densities of 3 x 10^ to 1x10"^ cells/welL 
The following day apoptotic inducers were added at the indicated concentrations and 

25 cells were incubated for the indicated time periods (usually 16-30 hours). Cells were 
• stained with 5 \xg/ml Hoechst (Molecular Probes, Inc.) and 750 nM MitoTracker Red 
(CMXRos, Molecular Probes, Inc.) in fresh medium and incubated for 30 minutes at 
37^C. Cells were washed in HBSS and fixed with 3.7% formaldehyde in HBSS at room 
temperature. Plates* were washed with HBSS and peimeabilized with 0.5% v/v Triton 

30 X-100 in HBSS at room temperature. Cells were washed 2X with HBSS at room 
temperature and the plate was sealed. For dual labeling of mitochondria, cells were 
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treated with 200 nM Mitotracker Greeai and 200 nM Mitotracker Red for 0.5 hours 
before fixation. 

Results & Discussion 

5 Induction of apoptosis by staurosporine and paclitaxel led to varying 

mitochondrial changes depending upon the stimulus. L929 cells exhibited a clear 
increase in mitochondrial mass with increasing staurosporine concentrations (Fig. 28). 
BHK cells exhibited either a decrease in membrane potential at lower concentrations of 
staurosporine, or an increase in mass at higher concentrations of staurosporine (Fig. 

10 28C). MCF-7 cells responded by a consistent decrease in mitochondrial membrane 
potential in response to increasing concentrations of staurosporine (Fig 28E). 
Increasing concentrations of paclitaxel caused consistent increases in mitochondrial 
mass (Fig 28B, 28D. and 28F). 

The mitochondrial membrane potential is measured by labeling mitochondria 

15 with both Mitotracker Green FM and Mitotracker Red (Molecular Probes, Inc). 
Mitotracker Red labeling is proportional to both mass and membrane potential. 
Mitotracker Green FM labeling is proportional to mass. The ratio of Mitotracker Red 
signal to the Mitotracker Green FM signal provides a measure of mitochondrial 
memSrane potential (Foot and Pierce, 1999). This ratio normalizes the mitochondrial 

20 mass with respect to the Mitotracker Red signal. (See Figure 28G) Combining the 
ability to normalize to mitochondrial mass with a measure of the membrane potential 
allows independent assessment of both parameters. 

Result of Evaluation: Both decreases in potential and increases in mass were observed 
25 depending on the cell hne and inducer tested. Dose dependent correlation demonstrates 

that this is a promising apoptotic indicator. 

It is possible to combine multiple measures of apoptosis by e:q)loiting the 

spectral domain of fluorescence spectroscopy. In fact, all of the nuclear morphology/f- 

actm content/mitochondrial mass/mitochondrial potential data shown earlier were 
30 collected as multiparameter assays, but were presented individually for clarity. 
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Example 7. Protease induced translocation of a signaling enzyme containing a 
disease-associated sequence from cytoplasm to nucleus. 

Plasmid construct A eukaryotic expression plasmid containing a coding 
5 sequence for a green fluorescent protein - caspase (Cohen (1997), Biochemical J. 
326:1-16; Liang et al. (1997), J. ofMolec. Biol 274:291-302) chimera is prepared using 
GFP mutants. The cpnstract is used to transfect eukaryotic cells. 

Cell preparation and transfection. Cells are trypsinized and plated 24 h prior 
to transfection and incubated at 3TC and 5% CO2. Transfections are performed by 
10 methods including, but not limited to calcium phosphate coprecipitation or lipofection. 
Cells are incubated with the calcium phosphate-DNA precipitate for 4-5 hours at 37°C 
and 5% CO2, washed 3-4 times with DMEM to remove the precipitate, followed by the 
addition of C-DMEM. Lipofectamine transfections are performed in serum-free 
DMEM without antibiotics according to the manufacturer's instructions. Following a 
15 2-3 hour incubation with the DNA-liposome complexes, the medium is removed and 
replaced with C-DMEM. 

Apopototic induction of Caspase-GFP translocation. To obtain Caspase-GFP 
translocation kinetic data, nuclei of transfected cells are first labeled with 5 }ig/ml 
Hoechst 33342 (Molecular Probes) in €-DMEM for 20 minutes at 37**C and 5% CO2. 
20 Cells are washed once in Hank's Balanced Salt Solution (EIBSS) followed by the 
addition of compounds that induce apoptosis. These compounds include, but are not 
limited to paclitaxel, staurosporine, ceramide, and tumor necrosis factor. To obtain 
fixed time point titration data, transfected cells are first washed with DMEM and then 
incubated at 37°C and 5% CO2 for 1 h in the presence of 0 - 1000 nM compound in 
25 DMEM. Cells are analyzed live or they are rinsed with HBSS, fixed for 15 min with 
3.7% formaldehyde m HBSS, stained with Hoechst 33342, and washed before analysis. 

Image acquisition and analysis. Kinetic data are collected by acquiring 
fluorescence image pairs (Caspase-GFP and Hoechst 33342-labeled nuclei) from fields 
of living cells at 1 min intervals for 30 min after the addition of compound. Likewise, 
30 image pairs are obtained from each well of the fixed time pomt screening plates 1 h 
after the addition of compound. In both cases, the image pairs obtained at each time 
point are used to define nuclear and cytoplasmic regions in each cell. Translocation of 
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Caspase-GFP is calculated by dividing the integrated fluorescence intensity of Caspase- 
GFP in the nucleus by the integrated fluorescence intensity of the chimera in the 
cytoplasm or as a nuclear-cytoplasmic difference of GFP fluorescence. In the fixed 
time point screen this translocation ratio is calculated from data obtained from at least 

5 200 cells at each concentration of compound tested. Drug-induced translocation of 
Caspase-GFP from the cytoplasm to the nucleus is therefore correlated with an increase 
in the translocation ratio. Molecular interaction libraries including, but not limited to 
those comprising putative activators or inhibitors of apoptosis-activated enzymes are 
use to screen the indicator cell lines and identify a specific ligand for the DAS, and a 

10 pathway activated by compound activity. 

Example 8. Identification of novel steroid receptors from DAS 

Two sources of material and/or information are required to make use of this 
embodiment, which allows assessment of the function of an tmcharacterized gene. 

15 First, disease associated sequence bank(s) cont a ining cDNA sequences suitable for 
transfection into mammalian cells can be used. Because every RADE or differential 
expression experiment generates up to several hundred sequences, it is possible to 
generate an ample supply of DAS. Second, information from primary sequence 
database searches can be used to place D^ into broad categories, including, but not 

20 limited to, those that contain signal sequences, seven trans-membrane motifs, 
conserved protease active site domains, or other identifiable motifs. Based on the 
information acquired from these sources, method types and indicator cell lines to be 
transfected are selected. A large number of motifs are already well characterized and 
encoded in the Unear sequences contained within the large number genes in existing 

25 genomic databases. 

In one embodiment, the foUowuig steps are taken: 

1) Information from the DAS identification experiment (including database 
searches) is used as the basis for selecting the relevant biological processes, (for 
example, look at the DAS from a tumor line for cell cycle modulation, apoptosis, 

30 metastatic proteases, etc.) 

2) Sorting of DNA sequences or DAS by identifiable motifs (ie. signal 
sequences, 7-, transmembrane domains, conserved protease active site domains, etc.) 
This initial grouping will detennine fluorescent tagging strategies, host cell lines, 
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indicator cell lines, and banks of bioactive molecules to be screened, as described 
supra. 

3) Using well established molecular biology methods, ligate DAS into an 
expression vector designed for this purpose. Generalized expression vectors contain 
promoters, enhancers, and terminators for which to deliver target sequences to the cell 
for transient expression. Such vectors may also contain antibody tagging sequences, 
direct association sequences, chromophore fusion sequences like GFP, etc. to facilitate 
detection when expressed by the host. 

4) Transiently transfect cells with DAS containing vectors using standard 
transfection protocols includuxg: calcium phosphate co-precipitation, liposome 
mediated, DEAE dextran mediated, polycationic mediated, viral mediated, or 
electroporation, and plate into microtiter plates or microwell arrays. Alternatively, 
transfection can be done directly in the microtiter plate itself. 

5) Carry out the cell screening methods as described supra. 

In this embodiment, DAS. shown to possess a motif(s) suggestive of 
transcriptional activation potential (for example, DNA bmding domam, ammo terminal 
modulating domain, hinge region, or carboxy teraiinal ligand binding domain) are 
utilized to identify novel steroid receptors. 

Defining the fluorescent tags for this experiment involves identification of the 
nucleus through ttainmg, and taggmg the DAS by creating a GFP chhnera via insertion 
of DAS into an expression vector, proximally fiised to the gene encoding GFP. 
Alternatively, a single chain antibody fragment with high affinity to some portion of the 
expressed DAS could be constructed using technology available in the art (Cambridge 
Antibody Technologies) and linked to a fluorophore (FTTC) to tag the putative 
transcriptional activator/receptor in the cells. This alternative would provide an 
external tag requiring no DNA transfection and therefore would be useful if distribution 
data were to be gathered firom the original primary cultures used to generate the DAS. 

Plasmid construct. A eukaryotic expression plasmid containing a coding 
sequence for a green fluorescent protein - DAS chimera is prepared using GFP 
mutants. The constmct is used to transfect HeLa cells. The plasmid, when transfected 
into the host cell, produces a GFP fused to the DAS protein product, designated GFP- 
DASpp. 
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Cell preparation and transfection. HeLa cells arie trypsinized and plated using 
DMEM containing 5% charcoal/dextran-treated fetal bovine serum (FBS) (Hyclone) 
and 1% penicillin-streptoniycin (C-DMEM) 12-24 hours prior to transfection and 
incubated at 37°C and 5% CO2 . Transfections are performed by calcium phosphate 

5 coprecipitation or with Lipofectamine (Life Technologies). For the calcium phosphate 
transfections, the medium is replaced, prior to transfection, with DMEM containing 5% 
charcoal/dextran-treated FBS. Cells are incubated with the calcium phosphate-DNA 
precipitate for 4-5 hours at 37°C and 5% CO2, and washed 3-4 times with DMEM to 
remove the precipitate, followed by the addition of C-DMEM. Lipofectamine 

10 transfections are performed in serum-free DMEM without antibiotics according to the 
manufacturer's instractions. Following a 2-3 hour incubation with the DNA-Iiposome 
complexes, the medium is removed and replaced with C-DMEM. All transfected cells 
in 96-well microtiter plates are incubated at 33**C and 5% CO2 for 24-48 hours prior to 
drug treatment Experiments are performed with the receptor expressed transiently in 

15 HeLa cells. 

Localization of expressed GFP-DASpp inside cells. To obtain cellular 
distribution data, nuclei of transfected cells are first labeled with 5 jig/ml Hoechst 
33342 (Molecular Probes) in C-DMEM for 20 minutes at 33*'C and 5% CO2. Cells are 
washed once in Hank's Balanced Salt Solution (HBSS). The cells are analyzed live or 
20 they are rinsed with HBSS, fixed for 15 min with 3.7% formaldehyde in HBSS, stained 
with Hoechst 33342, and washed before analysis. 

In a preferred embodiment, image acquisition and analysis are performed using 
the cell screening, system of the present invention. The intracellular GFP-DASpp 
fluorescence signal is collected by acquiring fluorescence image pairs (GFP-DASpp 
25 and Hoechst 33342-labeled nuclei) from field cells. The image pairs obtained at each 
time point are used to define nuclear and cytoplasmic regions in each cell. Data 
demonstrating dispersed signal in the cytoplasm would be consistent with known 
steroid receptors that axe DNA transcriptional activators. 

Screening for induction of GFP-DASpp translocation. Using the above 
30 construct, confirmed for appropriate expression of the GFP-DASpp, as an indicator cell 
line, a screen of various ligands is performed using a series of steroid type ligands 
including, but not limited to: estrogen, progesterone, retinoids, growth factors, 
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androgens, and many other steroid and steroid based molecules. Image acquisition ana 
analysis are perfonned using the cell screening system of the invention. The 
intracellular GFP-DASpp fluorescence signal is collected by acquiring fluorescence 
image pairs (GFP-DASpp and Hoechst 33342-labeled nuclei) from fields cells. The 

5 image pairs obtained at each time point are used to define nuclear and cytoplasmic 
regions in each cell. Translocation of GFP-DASpp is calculated by dividing the 
integrated fluorescence intensity of GFP-DASpp in the nucleus by the integrated 
fluorescence intensity of the chimera in the cytoplasm or as a nuclear-cytoplasmic 
difference of GFP fluorescence. A translocation from the cytoplasm into the nucleus 

10 indicates a Ugand binding activation of the DASpp thus identifying the potential 
receptor class and action. Combining this data with other data obtained in a sunilar 
fashion using known inhibitors and modifiers of steroid receptors, would either validate 
the DASpp as a target, or more data would be generated fi-om various sources. 

15 Example 9 Additional Screens 

Translocation between the plasma membrane and the cytoplasm: 

Profilactin complex dissociation and binding of proiilin to the plasma 
membrane. In one embodiment,' a fluorescent protein biosensor of profilin membrane 
binding is prepared by labeling purified profilin (Federov et al.(1994), J. Molec. Biol 

20 241:480-482; Lanbrechts et al. (1995), Eur. J. Biochem. 230:281-286) with a probe 
possessing a fluorescence lifetime in the range of 2-300 ns. The labeled profilin is 
introduced into living indicator cells using bulk loading methodology and the mdicator 
cells are treated with test compounds. Fluorescaice anisotropy imagmg microscopy 
(Gough and Taylor (1993), J. Cell Biol 121:1095-1107) is used to measure test- 

25 compound dependent movement of the fluorescent derivative of profilin between the 
cytoplasm and membrane for a period of time after treatment ranging from 0.1 s to 10 
h. 

Rho-RhoGDI complex translocation to the membrane. In another 

embodiment, indicator cells are treated with test compounds and then fixed, washed, 

30 and permeabilized. The indicator cell plasma membrane, cytoplasm, and nucleus are 

all labeled with distinctly colored markers followed by immunolocalization of Rho 

protein (Self et al. (1995), Methods in Enzyniology 256:3-10; Tanaka et al. (1995), 
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Methods in Enzymology 256:41-49) with antibodies labeled with a foitrth color. Each 
of the four labels is imaged separately using the cell screening system, and the images 
used to calculate the amount of inhibition or activation of translocation effected by the 
test compound. To do this calculation, the images of the probes used to mark the 

5 plasma membrane and cytoplasm are used to mask the image of the immunological 
probe marking the location of mtracellular Rho protein. The integrated brightness per 
unit area under each mask is used to form a translocation quotient by dividing the 
plasma membrane mtegrated brightness/area by the cytoplasmic integrated 
brightness/area. By comparing the translocation quotient values from control and 

10 experimental wells, the percent translocation is calculated for each potential lead 
compound. 

^-Arrestin translocation to the plasma membrane upon G-protein receptor activation. 

In another embodiment of a cytoplasm to membrane translocation high-content 

15 screen, the translocation of p-airestin protein from the cytoplasm to the plasma 
membrane is measured in response to cell treatment. To measure the translocation, 
living indicator cells containing luminescent domain markers are treated with test 
compounds and the movement of the ^.-arrestin marker is measured m time and space 
using the cell screening system of the present invention. In a preferred embodiment, 

20 the indicator cells contain luminescMit markers consisting of a green fluorescent protein 
(5-arrestin (GFP-p-airestin) protem chimera (Barak et al. (1997), J. Biol. Chem. 
272:27497-27500; Daaka et al. (1998), J. Biol. Chem. 273:685-688) that is expressed 
by the indicator cells through the use of transient or stable cell transfection and othra- 
reporters used to mark cytoplasmic and membrane domains. When the indicator cells 

25 are in the resting state, the domain marker molecules partition predominately in the 
plasma membrane or in the cytoplasm. In the high-content screen, these markers are 
used to delineate the cell cytoplasm and plasma membrane in distmct channels of 
fluorescence. When the indicator cells are treated with a test compound, the dynamic 
redistribution of the GFP-p-arrestin is recorded as a series of images over a time scale 

30 ranging from 0.1 s to 10 h. In a preferred embodiment, the time scale is 1 h. Each 
image is analyzed by a method that quantifies the movement of the GFP-p-arrestin 
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protein chimera between the plasma membrane and the cj^oplasm. To do this 
calculations the images of the probes used to mark the plasma membrane and cytoplasm 
are used to mask the image of the GFP-P-arrestin probe marking the location of 
intracellular GFP-P-arrestin protein. The integrated brightness per unit area imder each 

5 mask is used to form a translocation quotient by dividing the plasma membrane 
integrated brightness/area by the cytoplasmic integrated brightness/area* By comparing 
the translocation quotient values from control and experimental wells, the perceat 
translocation is calculated for each potential lead compound. The output of the high- 
content screen relates quantitative data describing the magnitude of the translocation 

10 within a large number of individual cells that have been treated with test compounds of 
interest. 

Translocation between the endoplasmic reticulum and the Golgi: 

In one embodiment of an endoplasmic reticulum to Golgi translocation high- 
content screen, the translocation of a VSVG protein from the ts045 mutant strain of 

15 vesicular stomatitis virus (EUenberg et al. (1997), J. Cell Biol 138:1193-1206; Presley 
et al (1997) Nature 389:81-85) from the endoplasmic reticulum to the Golgi domain is 
measured in response to cell treatment To measure the translocation, indicator cells 
containing luminescent reporters are treated with test compounds and the movement of 
the reporters is measured in space and time using the cell screening system of the 

20 present invention. The indicator cells contain luminescent reporters consisting of a 
GFP-VSVG protein chimera that is expressed by the indicator cell through the use of 
transient or stable cell transfection and other domain markers used to measure the 
localization of the endoplasmic reticulum and Golgi domains. When the indicator cells 
are in their resting state at 40*^0, the GFP-VSVG protem chimera molecules are 

25 partitioned predominately in the endoplasmic reticulum, hi this high-content screen, 
domain markers of distinct colors used to delineate the endoplasmic reticulum and the 
Golgi domains in distinct channels of fluorescence. When the indicator cells are treated 
with a test compound and the temperature is simultaneously lowered to 32'*C, the 
dynamic redistribution of the GFP-VSVG protein chimera is recorded as a series of 

30 images over a time scale ranging from 0.1 s to 10 h. Each image is analyzed by a 
method that quantifies the movement of the GFP-VSVG protein chimera between the 
endoplasmic reticulum and the Golgi domains. To do this calculation, the images of 
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the probes used to mark the endoplasmic reticulum and the Golgi domains are used to 
mask the image of the GFP-VSVG probe marking the location of intracellular GFP- 
VS VG protein. The integrated brightness per unit area under each mask is used to form 
a translocation quotient by dividing the endoplasmic reticulum integrated 

5 brightness/area by the Golgi integrated brightness/area. By comparing the translocation 
quotient values from control and experimental wells, the percent translocation is 
calculated for each potential lead compound. The output of the high-content screen 
relates quantitative data describing the magnitude of the translocation within a large 
number of individual cells that have been treated with test compounds of interest at 

10 final concentrations ranging from 10"^^ M to 10"^ M for a period ranging from 1 min to 
10 h. 

Induction and inhibition of organella}' function: 
Intracellular microtubule stability. 

15 In another aspect of the invention, an automated method for identifying 

compounds that modify microtubule structure is provided. In this embodiment, 
indicator cells are treated with test compounds and the distribution of luminescent 
microtubule-labeling molecules is measured in space and time using a cell screening 
system, such as the one disclosed above. The luminescent microtubule-labeling 

20 molecules may be expressed by or added to the cells either before, together with, or 
after contacting the cells with a test compound. 

In one embodiment of this aspect of the invention, living cells express a 
iuminescently labeled protein biosensor of microtubule dynamics, comprising a protein 
that labels microtubules fused to a lummescent protein. Appropriate microtubule- 

25 labeling proteins for this aspect of the invention include, but are not limited to a and p 
tubulin isoforms, and MAP4. Preferred embodiments of the luminescent protein 
include, but are not limited to green fluorescent protein (GFP) and OF? mutants. In a 
preferred embodiment, the method involves transfecting cells with a microtubule 
labeling luminescent protein, wherein the microtubule labeling protein can be, but is 

30 not limited to, a-tubulin, ^-tubulin, or microtubule-associated protein 4 (MAP4). The 
approach outlined here enables those skilled in the art to make live cell measurements 
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to determine the effect of lead compounds on tubulin activity and microtubule stability 
iji vivo. 

In a most preferred embodiment, MAP4 is fused to a modified version of the 
Aequorea victoria green fluorescent protein (GFP). A DNA constmct has been made 
5 w^hich consists of a fusion between the EGFP coding sequence (available from 
Clontech) and the coding sequence for mouse MAP4. (Olson et al., (1995), J. Cell 
Biol. 130(3): 639-650). MAP4 is a ubiquitous microtubule-associated protein that is 
known to interact with microtubules in interphase as well as mitotic cells (Olmsted and 
Murofiishi, (1993), MAP4. In "Guidebook to tiie Cytoskeleton and Motor Proteins." 
10 Oxford University Press. T. Kreis and R. Vale, eds.) Its localization, then, can serve as 
an indicator of the localization, organization, and integrity of microtubules in living (or 
fixed) cells at all stages of the cell cycle for cell-based HCS assays. While MAP2 and; 
tau (microtubule associated proteins expressed specifically in neuronal cells) have been 
used to form GFP chimeras (Kaech et aL, (1996) Neuron. 17: 1189-1199; Hall et aL, 
15 (1997), Proc. Nat. Acad. Sci. 94: 4733-4738) their restricted cell type distribution and 
the tendency of these proteins to bundle microtubules when overexpressed make these 
proteins less desirable as molecular reagents for analysis in live cells originating firom 
varied tissues and organs. Moderate overexpressjon of GFP-MAP4 does not disrupt 
microtubule function or integrity (Olson et aL, 1995). Similar constructs can be made 
20 using P-tubulin or a-tubulin via standard techniques in the art. These chimeras will 
provide a means to observe and analyze microtubule activity in Uving cells during all 
stages of the cell cycle. 

In another embodunent, the luminescently labeled protein biosensor of 
microtubule dynamics is expressed, isolated, and added to the cells to be analyzed via 
25 bulk loading techniques, such as microinjection, scrape loading, and impact-mediated 
loading. In this embodiment, there is not an issue of overexpression wifhin the cell, 
and thus a and p tubulin isoforms, MAP4, MAP2 and/or tau can all be used. 

In a further embodiment, the protein biosensor is expressed by the cell, and the 
cell is subsequently contacted with a luminescent label, such as a labeled antibody, that 
30 detects the protein biosensor, endogenous levels of a protem antigen, or both. In this 
embodiment, a luminescent label that detects a and p tubulin isoforms, MAP4, MAP2 
and/or tau, can be used. 
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A variety of GFP mutants are available, all of which would be effective in this 
invention, including, but 'not limited to, GFP mutants which are commercially available 
(Clontech, California). 

Tlie MAP4 construct has been introduced into several mammalian cell lines 
5 (BHK-21, Swiss 3T3, HeLa, HEK 293, LLCPK) and the organization and localization 
of tubulin has been visualized in live cells by virtue of the GFP fluorescence as an 
indicator of MAP4 localization. The construct can be expressed transiently or stable 
cell lines can be prepared by standard methods. Stable HeLa cell Knes expressing the 
EGFP-MAP4 chimera have been obtained, indicatmg that expression of tiie chimera is 
10 not toxic and does not interfere with mitosis. 

Possible selectable markers for establishment and maintenance of stable cell 
lines include, but are not limited to the neomycin resistance gene, hygromycin 
resistance gene, zeocin resistance gene, puromycin resistance gene, bleomycin 
resistance gene, and blastacidin resistance gene. 
15 The utility of this method for the monitoring of microtubule assembly, 

disassembly, and rearrangement has been demonstrated by treatment of transiently and 
stably transfected cells with microtubule drugs such as pacUtaxel, nocodazole, 
vincristine, or vinblastine. 

The present method provides high-content and combined high tiu:oughput-high 
20 content cell-based screens for anti-microtubule drugs, particularly as one parameter m a 
multi-parametric cancer target screen. The EGFP-MAP4 construct used herein can also 
be used as one of the components of a high-content screen that measures multiple 
signahng pathways or physiological events. In a preferred embodiment, a combined 
high throughput and high content screen is employed, wherein multiple cells in each of 
25 the locations containing cells are analyzed in a high throughput mode, and only a subset 
of the locations contaimng cells are analyzed m a high content mode. The high 
throughput screen can be any screen that would be useful to identify those locations 
contaming cells that should be further analyzed, including, but not limited to, 
identifying locations wifli increased luminescence mtensity, those exhibiting 
30 expression of a reporter gene, those undergoing calcium changes, and those 
undergoing pH changes. 
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In addition to drag screening applications, the present invention may be applied 
to clinical diagnostics, the detection of chemical and biological warfare weapons, and 
the basic research market since fundamental cell processes, such as cell division and 
motility, are highly dependent upon microtubule dynamics. 

5 

Image Acquisition and Analysis 

Image data can be obtained from either fixed or living indicator cells. To 
extract morphometric data from each of the images obtained the following method of 
analysis is used: 

10 1. Threshold each nucleus and cytoplasmic image to produce a mask that has value = 
0 for each pixel outside a nucleus or cell boundary. 

2. Overlay the mask on the original image, detect each object in the field (/.e., nucleus, 
or cell), and calculate its size, shape, and integrated intensity. 

3. Overlay the whole cell mask obtained above on the corresponding luminescent 
15 microtubule image and apply one or more of the following set of classifiers to 

determine the micrtotubule morphology and the effect of drugs on microtubule 
morphology. 

Microtubule morphology is defined using a set of classifiers to quantify aspects 
of microtubule shape, size, aggregation state, and polymerization ^ state. These 

2b classifiers can be based on approaches that include co-occurrence matrices, texture 
measurements, spectral methods, structural metiiods, wavelet transforms, statistical 
mefliods, or combinations thereof Examples of such classifiers are as follows: 

l/ A classifier to quantify microtubule length and width using edge 
detection methods such as that discussed in Kolega et al. ((1993). Biolmaging 1:136- 

IS 1 50), which discloses a non-automated method to determine edge strength in individual 
cells), to calculate the total edge strength within each cell. To nonnahze for cell size, 
the total edge strength can be divided by the cell area to give a "microtubule 
morphology** value. Large microtubule morphology values are associated witii strong 
edge strength values and are therefore maximal in cells containing distinct microtubule 

30 structures. Likewise, .small microtubule morphology values are associated witii weak 
edge strength and are minimal in cells with depolymerized microtubules. TTie 
physiological range of microtubule morphology values is set by treating cells with 
either the microtubxile stabilizing drug paclitaxel (10 \lM) or the microtubule 
depolymerizing drug nocodazole (10 p-g/ml). 

35 ... 4. 

2. A classifier to quantify microtubule aggregation mto punctate spots or 

foci using methodology from the receptor intemalization methods discussed supra. 
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3. A classifier to quantify microtubule depolymerization using a measure 
of image texture. 

5 4. A classifier to quantify apparent intercoimectivity, or branching (or 

both), of the microtubules. 

5. Measurement of the kinetics of microtubule reorganization using the 
above classifiers on a time series of images of cells treated! with test compounds. 

10 

In a further aspect, kits are provided for analyzing microtubule stability, 
comprising an expression vector comprismg a nucleic acid that encodes a microtubule 
labeling protein and instructions for using the e?q)ression vector for carrying out the 
methods described above. In a preferred embodiment, tiie expression vector further 

15 comprises a nucleic acid that encodes a luminescent protein, wherein the microtubule 
binding protein and the luminescent protein thereof are expressed as a fusion protein. 
Altematively, the kit may contain an antibody that specifically binds to the 
microtubule-labeliug protein. In a further embodiment, the kit includes cells that 
express the microtubule labeling protein. In a preferred embodiment, the cells are 

20 transfected with the expression vector. In another preferred embodiment, the kits 
further contain a compound that is known to dismpt microtubule structure, including 
but not limited to curacin, nocodazole, vincristine, or vinblastine. In another preferred 
embodiment, the kits further comprise a compound that is known to stabilize 
microtubule structure, including but not limited to taxol (paclitaxel), and 

25 discodermolide. 

In another aspect, the present invention comprises a machine readable storage 
medium comprising a program containing a set of instractions for causing a cell 
screening system to execute the disclosed methods for analyzing microtubule stability, 
wherein the cell screening system comprises an optical system with a stage adapted for 

30 holding a plate containing cells, a digital camera, a means for directing fluorescence or 
luminescence emitted firom the cells to the digital camera, and a computer means for 
receiving and processing the digital data from the digital camera. 
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High'Content screens involving the functional localization ofmacromolecules 

Within this class of high-content screen, the functional localization of 
macromolecules in response to external stimuli is measured within living cells. 

Glycolytic enzyme activity regulation. In a preferred embodiment of a 
5 cellular enzyme activity high-content screen, the activity of key glycolytic regulatory 
enzymes are measured in treated cells. To measure enzyme activity, indicator cells 
containing luminescent labeling reagents are treated with test compounds and the 
activity of the reporters is measured in space and time using cell screening system of 
the present invention. 

10 In one embodiment, the reporter of intracellular enzyme activity is fi-uctose-6- 

phosphate, 2-kmase/fructose-2,6-bisphosphatase (PFK-2), a regulatory enzyme whose 
phosphorylation state indicates intracellular carbohydrate anabolism or catabolism 
(Deprez et al. (1997) J, Biol Chem. 272:17269-17275; Kealer et al. (1996) F'EBS 
Letters 395:225-227; Lee et al. il996\ Biochemist?y 35:6010-6019). The indicator 

15 cells contain luminescent reporters consisting of a fluorescent protein biosensor of 
PFK-2 phosphorylation. The fluorescent protein biosensor is constructed by 
introducing an environmentally sensitive fluorescent dye near to the known 
phosphorylation site of the enzyme (Deprez et al. (1997), supra; Giuliano e; al. (1995), 
supra). The dye can be of the ketocyanine class (Kessler and Wolfbeis (1991), 

20 Spectrochimica Acta 47A:187-192 ) or any class that contains a protein reactive moiety 
and a fluorochrome whose excitation or emission spectrum is sensitive to solution 
polarity. The fluorescent protein biosensor is introduced into the indicator cells using 
bulk loading methodology. 

Living indicator cells are treated with test compounds, at final concentrations 

25 ranging from 10'*^ M to 10"^ M for times ranging from 0.1 s to 10 h. In a preferred 
embodiment, ratio image data are obtained from living treated indicator cells by 
collecting a spectral pair of fluorescence images at each time point To extract 
morphometric data from each time point, a ratio is made between each pair of images 
by numerically dividing the two spectral images at each time point, pixel by pixel. 

30 Each pixel value is then used to calculate the fractional phosphorylation of PFK-2. At 
small fractional values of phosphorylation, PFK-2 stimulates carbohydrate catabolism. 
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At high fractional values of phosphorylation, PFK-2 stimulates carbohydrate 
anabolism. 

Protein kinase A activity and localization of subunits. In another 

5 embodiment of a high-content screen, both the domzun localization and activity of 
protein kinase A (PKA) within indicator cells are measured in response to treatment 
with test compounds. 

The indicator cells contain luminescent reporters including a fluorescent protein 
biosensor of PKA activation. The fluorescent protein biosensor is constructed by 

10 introducing an environmentally sensitive fluorescent dye into the catalytic subunit of 
PKA near the site known to interact with the regulatory subunit of PKA (Harootunian 
et al. (1993), MoL BioL of the CeZM:993-1002; Johnson et aL (1996), Cell 85:149-158; 
Giuliano et al. (1995), supra). The dye can be of the ketocyanine class (Kessler, and 
Wolfljeis (1991), Specirochimica Acta 47A:187-192) or any class that contains a 

15 protein reactive moiety and a fluorochrome -whose excitation or emission spectram is 
sensitive to solution polarity. The fluorescent protein biosensor of PKA activation is 
introduced into the indicator cells using bulk loading methodology. 

In one embodiment, living indicator cells are treated with test compounds, at 
final concentrations ranging from 10"^^ M to 10'^ M for times ranging from 0.1 s to 10 

20 h. In a preferred embodiment, ratio image data are obtained from livmg treated 
indicator cells. To extract biosensor data from each time point, a ratio is made between 
each pair of images, and each pixel value is then used to calculate the fractional 
activation of PKA (e.g., separation of the catalytic and regulatory subunits after cAMP 
binding). At high fractional values of activity, PFK-2 stimulates biochemical cascades 

25 within the living cell. 

To measure the translocation of the catalytic subunit of PKA, indicator cells 
containing luminescent reporters are treated with test compounds and the movement of 
the reporters is measured in space and time using the cell screening system. The 
indicator cells contain liuninescent reporters consisting of domain markers used to 

30 measure the localization of the cytoplasmic and nuclear domains; When the indicator 
cells are treated with a test compounds, the dynamic redistribution of a PKA 
fluorescent protein* biosensor- is recorded intracellularly as a series of images over a 
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time scale ranging fiom 0.1 s to 10 h. Each image is analyzed by a method that 
quantifies the movement of the PKA between the cytoplasmic and nuclear domains. To 
do this calculation, the images of the probes used to mark the cytoplasmic and nuclear 
domains are used to mask the image of the PKA fluorescent protein biosensor. The 

5 mtegrated brightness per unit area under each mask is used to form a translocation 
quotient by dividing the cytoplasmic integrated brightness/area by the nuclear 
integrated brightness/area. By comparing the translocation quotient values firom 
control and experimental wells, the percent translocation is calculated for each potential 
lead compound. The output of the high-content screen relates quantitative data 

10 describing the magnitude of the translocation within a large number of individual cells 
that have been treated with test compound in the concentration range of 10' M to 10" 
M. 

High-content screens involving the induction or inhibition of gene expression 
15 RNA-based fluorescent biosensors 

Cytoskeletal protein transcription and message localization. Regulation of 
the general classes of cell physiological responses including cell-substrate adhesion, 
cell-cell adhesion, signal transduction, cell-cycle events, intermediary and sipialing 
molecule metabolism, cell locomotion, cell-cell communication, and cell death can 
20 involve the alteration of gene expression. High-content screens can also be designed to 
measure this class of physiological response. 

In one emboctiment, tiie reporter of mti^cellular gene expression is an 
oligonucleotide that can hybridize with the target mKNA and alter its fluorescence 
signal. In a preferred embodiment, the oligonucleotide is a molecular beacon (Tyagi 
25 and Kramer (1996) Nat. Biotechnol 14:303-308), a luminescence-based reagent whose 
fluorescence signal is dependent on intermolecular and intiramolecular interactions. 
The fluorescent biosensor is constructed by introducing a fluorescence energy transfer 
pair of fluorescent dyes such that there is one at each end (5' and 3') of the reagent 
The dyes can be of any class that contains a protein reactive moiety and fluorochromes 
30 whose excitation and emission spectra overlap sufficiently to provide fluorescence 
energy transfer between the dyes in the resting state, including, but not limited to, 
fluorescein and rhodamine (Molecular Probes, Inc.). In a preferred embodiment, a 
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portion of the message coding for p-actin (Kislauskis et al, (1994), J, Cell Biol. 
127:441-451; McCann et al. (1997), Proa Natl Acad. ScL 94:5679-5684; Sutoh 
(1982), Biochemistry 21:3654-3661) is inserted into the loop region of a hairpin-shaped 
oligonucleotide with the ends tethered together due to intramolecular hybridization. At 
5 each end of the biosensor a fluorescence donor (fluorescein) and a fluorescence 
acceptor (rhodamine) are covalently bound. In the tethered state, the fluorescence 
energy transfer is maximal and therefore indicative of an imhybridized molecule. 
When hybridized with the mRNA coding for (5-actin, the tether is broken and energy 
transfer is lost. The complete fluorescent biosensor is introduced into the indicator 

10 cells using bulk loading methodology. 

In one embodiment, living indicator cells are treated with test compounds, at 
final concentrations ranging from 10"^^ M to 10"^ M for times ranging from 0.1 s to 10 
h. In a preferred embodiment, ratio image data are obtained from living treated 
indicator cells. To extract morphometric data from each time point, a ratio is made 

15 between each pair of images, and each pixel value is then used to calculate the 
fractional hybridization of the labeled nucleotide. At small fractional values of 
hybridization little* expression of P-actin is indicated. At high fractional values of 
hybridization, maximal expression of p-actin is indicated. Furthermore, the distribution 
of hybridized molecules within the cytoplasrn of the indicator cells is also a measure of 

20 the physiological response of the indicator cells. 

Cell surface binding of a ligand 

Labeled insulin binding to its cell surface receptor in living cells. Cells 
whose plasma membrane domain has been labeled wdth a labeling reagent of a 

25 particular color are incubated with a solution containing insulin molecules (Lee et al. 
(1997), Biochemistry 36:2701-2708; Martinez-Zaguilan et al. (1996), Am. J. Physiol 
270:C1438-C1446) that are labeled with a luminescent probe of a different color for an 
appropriate time under the appropriate conditions. After incubation, unbound insulin 
molecules are washed away, the cells fixed and the distribution and concentration of the 

30 insulin on the plasma membrane is measured. To do this, the cell membrane image is 

used as a mask for the insulin image. The integrated intensity from the masked insulin 

image is compared to a set of images containing known amoimts of labeled insulin. 
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The amount of insulin bound to the cell is determined from the standards and used in 
conjunction with the total concentration of insulin incubated with the cell to calculate a 
dissociation constant or insulin to its cell surface receptor. 

5 Labeling of cellular compartments 
Whole cell labeling 

Whole cell labeling is accomplished by labeling cellular components such that, 
djmamics of cell shape and motility of the cell can be measured over time by analj^zing 
fluorescence images of cells. 

10 In one embodiment, small reactive fluorescent molecules are introduced into 

living cells. These membrane-permeant molecules both diffuse through and react with 
protein components in the plasma membrane. Dye molecules react with intracellular 
molecules to both increase the fluorescence signal emitted from each molecule and to 
entrap the fluorescent dye within living cells. These molecules include reactive 

15 chloromethyl derivatives of aminocoumarins, hydroxycoumarins, eosin diacetate, 
fluorescein diacetate, some Bodipy dye derivatives, and tetramethyhrhodamine. The 
reactivity of these dyes toward macromolecules includes free primary amino groups 
and free sulfliydryl groups. 

In another embodiment, the cell surface is labeled by allowing the cell to 

20 interact with fluorescently labeled antibodies or lectins (Sigma Chemical Company, St. 
Louis, MO) that react specifically with molecules on the cell surface. Cell surface 
protein chimeras expressed by the cell of interest that contain a green fluorescent 
protein, or mutant thereof, component can also be used to fluorescently label the entire 
cell surface. Once the entire cell is labeled, images of the enture cell or cell array can 

25 become a parameter in high content screens, involving the measurement of cell shape, 
motility, size, and growth and division- 



Plasma membrane labeling 

, In one embodiment, labeling the whole plasma membrane employs some of the 
30 same methodology described above for labeling the entire cells. Luminescent 
molecules that label the entire cell surface act to delineate the plasma membrane. 
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In a second embodiinent subdomains of the plasma membrane, the extracellular 
surface, the lipid bilayer, and the intracellular surface can be labeled separately and 
used as components of high content screens. In the &st embodiment, the extracellular 
surface is labeled using a brief treatment with a reactive fluorescent molecule such as 

5 the succinimidyl ester or iodoacetamde derivatives of fluorescent dyes such as the 
fluoresceins, rhodamines, cyanines, and Bodipys. 

In a third embodiment, the extracellular surface is labeled using fluorescently 
labeled macromolecules vsath a high affinity for cell surface molecules. These include 
fluorescently labeled lectins such as the fluorescein, rhodamine, and cyanine 

10 derivatives of lectins derived from jack bean (Con A), red kidney' bean 
(eiythroagglutinin PHA-E), or wheat germ. 

In a fourth embodiment, fluorescently labeled antibodies with a high affinity for 
cell surface components are used to label the extmcellular region of the plasma 
membrane. Extracellular regions of cell surface receptors and ion channels are 

15 examples of proteins that can be labeled with antibodies. 

In a fifth embodiment, the lipid bilayer of the plasma membrane is labeled with 
fluorescent molecules. These molecules include fluorescent dyes attached to long chain 
hydrophobic molecules that interact strongly with the hydrophobic region in the center 
. of the plasma membrane lipid bilayer. Examples of these dyes i;iclude the PKH series 

20 of dyes (U.S. 4,783,401, 4,762701, and 4,859,584; available commercially from Sigma 
Chemical Company, St. Loius, MO), fluorescent phospholipids such as 
nitrobenzoxadiazole glycerophosphoethanolanoune and fluorescein-derivatized 
dihexadecanoylglyeerophosphoetha-nolamine, fluorescent fatty acids such as 5-butyl- 
4,4-difluoro-4-bora-3a,4a-diaza-s-indacene-3-nonanoic acid and 1-pyrenedecanoic acid 

25 (Molecular Probes, Inc.), fluorescent sterols including cholesteryl 4,4-difluoro-5,7- 
dimethyl-4-bora-3a,4a-diaza-s-indacene-3-dodecanoate and cholesteryl 1- 
pyrenehexanoate, and fluorescently labeled proteins that interact specifically with lipid 
bilayer components such as the fluorescein derivative of annexin V (Caltag Antibody 
Co, Burlingame, CA). 

30 In another embodiment, the intracellular component of the plasma membrane is 

labeled with fluorescent molecules. Examples of these molecules are the intracellular 
components of the trimeric G-protein receptor, adenylyl cyclase, and ionic transport 

81 



BNSDOCtD: <WO__O05QB72A3_lA> 



wo 00/50872 PCT/USOO/04794 

proteins. These molecules can be labeled as a result of tight binding to a fluorescently 
labeled specific antibody or by the incorporation of a fluorescent protein chimera that is 
comprised of a membrane-associated protein and the green fluorescent protein, and 
mutants thereof. 

5 

Endosome fluorescence labeling 

In one embodiment, ligands that are transported into cells by receptor-mediated 
endocytosis are used to trace the dynanfiics of endosomal organelles. Examples of 
labeled ligands include Bodipy FL-labeled low density lipoprotein complexes, 
10 tetramethylrhodamine transferrin analogs, and fluorescently labeled epidermal growth 
factor (Molecular Probes, Lie.) 

In a second embodiment, fluorescently labeled primaiy or secondary antibodies 
(Sigma Chemical Co. St. Louis, MO; Molecular Probes, Inc. Eugene, OR; Caltag 
Antibody Co.) that, specifically label endosomal ligands are used to mark the 
15 endosomal compartment in cells. 

In a third embodiment, endosomes are fluorescently labeled in cells expressing 
protein chimeras formed by fusing a green fluorescent protein, or mutants thereof, with 
a receptor whose internalization labels endosomes. Chimeras of the EGF, transferrin, 
and low density lipoprotein receptors are examples of these molecules. 

20 

Lysosome labeling 

In one embodiment, membrane penneant lysosome-specific luminescent 
reagents are used to label the lysosomal compartment of living and fixed cells. These 
reagents include the luminescent molecules neutral red, N-(3-((2,4- 

25 diiiitrophenyl)amino)propyl)-N-(3-aniinopropyl)methylamine, and the LysoTracker 
probes which report intralysosomal pH as well as the dynamic distribution of 
lysqsomes (Molecular Probes, Inc.) 

In a second embodiment, antibodies against lysosomal antigens (Sigma 
Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 

30 lysosomal components that are localized in specific lysosomal domains. Examples of 
these components are the degradative enzymes involved in cholesterol ester hydrolysis, 
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membrane protein proteases, and nucleases as well as the ATP-driven lysosomal proton 
pump. 

In a third embodiment, protein chimeras consisting of a lysosomal protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
5 protein, or mutants thereof, are used to label the lysosomal domain. Examples of these 
components are the degradative enzymes involved in cholesterol ester hydrolysis, 
membrane protein proteases, and nucleases as well as the ATP-driven lysosomal proton 
pump. 

10 Cytoplasmic fluorescence labeling 

In one embodiment, cell permeant fluorescent dyes (Molecular Probes, Inc.) 
with a reactive group are reacted with living cells. Reactive dyes including 
monobromobimane, 5-chloromethylfluorescein diacetate, carboxy fluorescein diacetate 
succinimidyl ester, and chloro.methyl tetramethylrhodamine are examples of cell 

15 permeant fluorescent dyes that are used for long term labeling of the cytoplasm of cells. 

In a second embodiment, polar tracer molecules such as Lucifer yellow and 
cascade blue-based fluorescent dyes (Molecular Probes, Inc.) are introduced into cells 
using bulk loading methods and are also used for cytoplasmic labeling. 

In a third embodiment, antibodies against cytoplasmic components (Sigma 

20 Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to fluorescently 
label the cytoplasm. Examples of cytoplasmic antigens are many of the enzymes 
involyed in intermediary metabolism, Enolase, phosphofinictokinase, and acetyl-CoA 
dehydrogenase are examples of uniformly distributed cytoplasmic antigens. 

In a fourth embodiment, protein chimeras consisting of a cytoplasmic protein 

25 genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the cytoplasm. Fluorescent chimeras of 
uniformly distributed proteins are used to label the entire cytoplasmic domain. 
Examples of these proteins are many of the proteins involved in intermediary 
metabolism and include enolase, lactate dehydrogenase, and hexokinase, 

30 In a fifth embodiment, antibodies against cytoplasmic antigens (Sigma 

Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 

cytoplasmic components that are localized in specific cytoplasmic sub-domains. 
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Examples of these components are the cytoskeletal proteins actin, tubulin, and 
cytokeratin. A population of these proteins within cells is assembled into discrete 
structures, which in this case, are fibrous. Fluorescence labeling of these proteins with 
antibody-based reagents therefore labels a specific sub-domain of the cytoplasm. 
5 In a sixth embodiment, non-antibody-based fluorescently labeled molecules that 

interact strongly with cytoplasmic proteins are used to label specific cytoplasmic 
components. One example is a fluorescent analog of the enzyme DNAse I (Molecular 
Probes, Inc.) Fluorescent analogs of this enzyme bind tightly and specifically to 
cytoplasmic actin, thus labeling a sub-domain of the cytoplasm. In another example, 
10 fluorescent analogs of the mushroom toxin phalloidin or the dmg paclitaxel (Molecular 
Probes, Inc.) are used to label components of the actin- and microtubule-cytoskeletons, 
respectively. 

In a seventh embodiment, protein chimeras consisting of a. cytoplasmic protein 
genetically fiised to an intrinsically luminescent protein such as the green fluorescent 
15 protein, or mutants thereof, are used to label specific domains of the cytoplasm. 
Fluorescent chimeras of highly localized proteins are used to label cytoplasmic sub- 
domains. Examples of these proteins are many of the proteins involved in regulating 
the cytoskeleton. They include the structural proteins actin, tubulin, and cytokeratin as 
well as the regulatory proteins microtubule associated protein 4 and a-actinin. 

20 

Nuclear labeling 

In one embodiment, membrane permeant nucleic-acid-specific luminescent 
reagents (Molecular Probes, Inc.) are used to label the nucleus of living and fixed cells. 
These reagents include cyanine-based dyes (e.g., TOTO®, YOYO®, and BOBO™), 
25 phenanthidines and acridines (e.^., ethidium bromide, propidium iodide, and acridine 
orange), indoles and imidazoles (e,g,, Hoechst 33258, Hoechst 33342, and 4%6- 
diamidino-2-phenyiindole), and other similar reagents (e.g*., 7-aminoactinomycin D, 
hydroxystilbamidine, and the psoralens). 

In a second embodiment, antibodies against nuclear antigens (Sigma Chemical 
30 Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label nuclear 
components that are localized in specific nuclear domains. Examples of these 
components are the macromolecules involved in maintaining DNA stracture and 
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function. DNA, RNA, liistones, DNA polymerase, RNA polymerase, lamins, and 
nuclear variants of cytoplasmic proteins such as actin are examples of nuclear antigens. 

In a third embodiment, protein chimeras consisting of a nuclear protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
5 protein, or mutants tiiereof, are used to label the nuclear domain. Examples of these 
proteins are many of the proteins involved in maintaining DNA stmcture and function. 
Histones, DNA polymerase, RNA polymerase, lamins, and nuclear varimts of 
cytoplasmic proteins such as actin are examples of nuclear proteins. 

10 Mitochondrial labeling 

In one embodiment, membrane permeant mitochondrial-specific luminescent 
reagents (Molecular Probes, Inc.) are used to label the mitochondria of living and fixed 
cells. These reagents include rhodamine 123, tetramethyl rosamine, JC-1, and the 
MitoTracker reactive dyes. 

15 In a second embodiment, antibodies agmnst mitochondrial antigens (Sigma 

Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 
mitochondrial components that are localized in specific mitochondrial domains. 
Examples of these components are the macromolecules involved in maintaining 
/ mitochondrial DNA stmcture and function. DNA, RNA, histones, DNA polymerase, 

20 RNA polymerase, and mitochondrial variants of cytoplasmic macromolecules such as 
mitochondrial tRNA and rRNA are examples mitochondrial antigens. Other examples 
of mitochondrial antigens are the components of the oxidative phosphorylation system 
found in the mitochondria (e.g., cytochrome c, cytochrome c oxidase, and succinate 
dehydrogenase). 

25 In a third embodunent, protein chimeras consisting of a mitochondrial protein 

genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the mitochondrial domain. Examples of 
these components are the macromolecules involved in maintaining mitochondrial DNA 
structure and function. Examples include histones, DNA polymerase, RNA 

30 polymerase, and the components of the oxidative phosphorylation system found in the 
mitochondria (e,g., cytochrome c, cytochrome c oxidase, and succinate 
dehydrogenase). 
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Endoplasmic reticulum labeling 

In one embodiment, membrane peimeant endoplasmic reticulum-specific 
luminescent reagents (Molecular Probes, Inc.) are used to label the endoplasmic 
reticulum of living and fixed cells. These reagents include short chain carbocyanine 
5 dyes {e.g., DiOCs and DiOCa), long chain carbocyanine dyes DiICi6 and DilCis), 
and luminescently labeled lectins such as concanavalin A. 

In a second embodiment, antibodies against endoplasmic retictdum antigens 
(Sigma Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 
endoplasmic reticulum components that are localized in.specific endoplasmic reticulima 
10 domains. Examples of these components are the macromolecules involved in tKe fatty 
acid elongation systems, glucose-6-phosphatase, and HMG CoA-reductase. 

In a third embodiment, protein chimeras consisting of a endoplasmic reticulum 
protein genetically fused to an intrinsically luminescent protein such as the green 
fluorescent protein, or mutants thereof, are .used to label the endoplasmic reticulum 
15 domain. Examples of these components are the macromolecules involved in the fatty 
acid elongation systems, glucose-6-phosphatase, and HMG CoA-reductase. 

Golgi labeling 

In one embodiment, membrane permeant Golgi-specific luminescent reagents 
(Molecular Probes, Inc.) are used to label the Gdlgi of living and fixed cells. These 
20 reagents include luminescently labeled macromolecules such as wheat germ agglutinin 
and Brefeldin A as well as luminescently labeled ceramide. 

In a second embodiment, antibodies against Golgi antigens (Sigma Chemical 
Co.; Molecular Probes^ Inc.; Caltag Antibody Co.) are used to label Golgi components 
that are localized in specific Golgi domains. Examples of these components are N- 
25 acetylglucosamine phosphotransferase, Golgi-specific phosphodiesterase, and 
mannose-6-phosphate receptor protein. 

In a third embodiment, protein chimeras consisting of a Golgi protein 
genetically fiised to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the Golgi domam. Examples of these 
30 components are N-acetylglucosamine phosphotransferase, Golgi-specific 
phosphodiesterase, and mannose-6-phosphate receptor protein. 
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While many of the examples presented involve the measurement of single 
cellular processes, this is again is intended for purposes of illustration only. Multiple 
parameter high-content screens can be produced by combining several single parameter 
screens into a multiparameter high-content screen or by adding cellular parameters to 

5 any existing high-content screen. Furthermore, while each example is described as 
being based on either live or fixed cells, each high-content screen can be designed to be 
used with both live and fixed cells. 

Those skilled in the art will recognize a wide variety of distinct screens that can 
be developed based on the disclosure provided herein. There is a large and growing list 

10 of known biochemical and molecular processes in cells that involve translocations or 
reorganizations of specific components within cells. The signalmg pathway from the 
cell surface to target sites within the cell involves the translocation of plasma 
membrane-associated proteins to the cytoplasm. For example, it is known that one of 
the src family of protein tyrosine kinases, pp60c-srG (Walker et al (1993), J, Biol 

15 Chenu 268:19552-19558) translocates from the plasma membrane to the cytoplasm 
upon stimulation of fibroblasts with platelet-derived growth factor (PDGF). 
Additionally, the targets for screening can themselves be converted into fluorescence- 
based reagents that report molecular changes including ligand-binding and post- 
translocational modifications. '\ ^ 

20 

Example 10. Protease Biosensors 
(1) Background 

As xased herein, the following terms are defined as follows: 

• Reactant - the parent biosensor that interacts with the proteolytic enzyme. 

25 • Product - the signal-containing proteolytic fragment(s) generated by the interaction 
of the reactant with the enzyme. 

• Reactant Target Sequence - an amino acid sequence that imparts a restriction on the 
cellular distribution of the reactant to a particular subcellular domaiu of the cell. 

• Product Target Sequence - an amino acid sequence that imparts a restriction on the 
30 cellular distribution of the signal-containing product(s) of the targeted enzymatic 

reaction to a particular subcellular domain of the cell. If the product is initially 
localized within a membrane bound compartment, then the Product Target 
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Sequence must incorporate the ability to export the product out of the membrane- 
bound compartment. A bi-functional sequence can be used, which first moves the 
product out of the membrane-bound compartment, and then targets the product to 
the final compartment. In general, the same amino acid sequences can act as either 
5 or both reactant target sequences and product target sequences* Exceptions to this 

include amino acid sequences which target the nuclear envelope, Golgi apparatus, 
endoplasmic reticuulum, and which are involved in famesylation, which are more 
suitable as reactant target sequences. 

• Protease Recognition Site - an amino acid sequence that imparts specificity by 
10 mimicking the substrate, providing a specific binding and cleavage site for a 

protease. Although typically a short sequence of amino acids representing the 
minimal cleavage site for a protease (e.g. DEVD for caspase-3, Villa, P., S.H. 
Kaufinarai, and W.C. Eamshaw. 1997. Caspases and caspase inhibitors. Trends 
Biochem Scu 22:3S8-93), greater specificity may be established by using a longer 
15 sequence fi"om an established substrate. 

• Compartment - any cellular sub-structure or macromolecular component of the cell, 
whether it is made of protein, lipid, carbohydrate, or nucleic acid. It could be a 
macromolecular assembly or an organelle (a membrane delimited cellular 
component). Compartments include, but are not limited to, cytoplasm, nucleus, 

20 nucleolus, inner and outer surface of nuclear envelope, cytoskeleton, peroxisome, ^ 

endosome, lysosome, inner leaflet of plasma membrane, outer leaflet of plasma 
membrane, outer leaflet of mitochondrial membrane, inner leaflet of mitochondrial 
membrane, Golgi, endoplasmic reticulum, or extracellular space. 
Signal — an amino acid sequence that can be detected. This includes, but is not 

25 limited to inherently fluorescent proteins (e.g. Green Fluorescent Protein), cofactor- 

requiring fluorescent or luminescent proteins (e.g. phycobiliproteins or luciferases), 
and epitopes recognizable by specific antibodies or other specific natural or 
unnatural binding probes, including but not limited to dyes, enzyme cofactors and 
engineered binding molecules, which are fluorescently or luminescently labeled. 

30 Also included are site-specifically labeled proteins that contain a luminescent dye. 

Methodology for site-specific labeling of proteins includes, but is not limited to, 
engineered dye-reactive amino acids (Post, et aL, J. Biol. Chem. 269:12880-12887 
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(1994)), enzyme-based incorporation of luminescent substrates into proteins 
(Buckler, et al., AnalyL Biochenu 209:20-31 (1993); Takashi, Biochemistry. 
27:938-943 (1988)), and the incorporation of unnatural labeled amino acids into 
proteins (Noren, et al.. Science. 244:182-188 (1989)), 
5 • Detection - a means for recording the presence, position, or amount of the signal. 
The approach may be direct, if the signal is inherently fluorescent, or indirect, if, for 
example, the signal is an epitope that must be subsequently detected with a labeled 
antibody. Modes of detection include, but are not limited to, the spatial position of 
fluorescence, luminescence, or phosphorescence: (1) intensity; (2) polarization; (3) 
10 lifetime; (4) wavelength; (5) energy transfer; and (6) recovery after photobleaching. 

The basic principle of the protease biosensors of the present invention is to 
spatially separate the reactants from the products generated during a proteolytic 
reaction. The separation of products from reactants occurs upon proteolytic cleavage of 
the protease recognition site within the biosensor, allowing the- products to bind to, 
15 diffuse into, or be imported into compartments of the cell different from those of the 
reactant. This spatial separation provides a means of quantitating a proteolytic process 
directly in living or JBxed cells. Some designs of the biosensor provide a means of 
restricting the reactant (uncleaved biosensor) to a particular compartment by a protein 
sequence ("reactant target sequence") that binds to or imports the biosensor into a 
20 compartment of the cell. These compartments include, but are not limited to any 
cellular substructure, macromolecular cellular component, membrane-limited 
organelles, or the extracellular space. Given that the characteristics of the proteolytic 
reaction are related to product concentration divided by the reactant concentration, the 
spatial separation of products and reactants provides a means of uniquely quantitating 
25 products and reactants in single cells, allowmg a more direct measure of proteolytic 
activity. 

The molecular-based biosensors may be introduced into cells via transfection 
and the expressed chimeric proteins analyzed in transient cell populations or stable cell 
lines. They may also be pre-formed, for example by production m a prokaryotic or 
30 eukaryotic expression system, and the purified protein introduced into the cell via a 
number of physical mechanisms including, but not limited to, micro-injection, scrape 
loading, electroporation, signal-sequence mediated loading, etc. 
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Measurement modes may include, but. are not limited to, the ratio or difference 
in fluorescence, luminescence, or phosphorescence: (a) intensity; (b) polarization; or (c) 
lifetime between reactant and product. These latter modes require appropriate 
spectroscopic differences between products and reactants. For example, cleaving a 
reactant containing a limited-mobile signal into a very small translocating component 
and a relatively large non-translocating component may be detected by polarization. 
Alternatively, significantly different emission lifetimes between reactants and products 
allow detection in imaging and non-imaging modes. . 

One example of a family of enzymes for w^hich this biosensor can be 
constmcted to report activity is the caspases. Caspases are a class of proteins that 
catalyze proteolytic cleavage of a wide variety of targets during apoptosis. Following 
initiation of apoptosis, the Class II "downstream" caspases are activated and are the 
point of no return in the pathway leading to cell death, resulting in cleavage of 
downstream target proteins. In specific examples, the biosensors described here were 
engineered to use nuclear translocation of cleaved GFP as a measurable indicator of 
caspase activation. Additionally, the use of specific recognition sequences that 
incorporate surrounding amino acids involved in secondary stmcture formation in 
naturally occurring proteins may increase the specificity and sensitivity of this class of 
biosensor. 

Another example of a protease class for which this biosensor can be constmcted 
to report activity is zinc metalloproteases. Two specific examples of this class are the 
biological toxins derived firom Clostridial species (C. botulinum and C tetani) and 
Bacillus anthracis. CHerreros et al. In The Comprehensive Sourcebook of Bacterial 
Protein Toxins. J.E. Alouf and J-H. Freer, Eds. 2""^ edition, San Diego, Academic Press, 
1999; pp 202-228.) These bacteria express and secrete zinc metalloproteases that enter 
eukaryotic cells and specifically cleave distinct target proteins. For example, the 
anthrax protease fi-om Bacillus anthracis is delivered into the cytoplasm of target cells 
via an accessory pore-forming protein, where its proteolytic activity inactivates the 
MAP-kinase signaling cascade through cleavage of mitogen activated protein kinase 
kinases 1 or 2 (MEKl or MEK2). (Leppla, S.A In The Comprehensive Sourcebook of 
Bacterial Protein Toxins. J.E. Alouf and J.H. Freer, Eds. 2""* edition, San Diego, 
Academic Press, 1999; pp243-263.) The toxin biosensors described here take 
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advantage of the natural subcellular localization of these and other target proteins to 
achieve reactant targeting. Upon cleavage, the signal (with or without a product target 
sequence) is separated from the reactant to create a high-content biosensor. 

One of skill in the art will recognize that the protein biosensors of this aspect of 
5 the invention can be adapted to report the activity of any member of the caspase family 
of proteases, as well as any other protease, by a substitution of the appropriate protease 
recognition site in any of the constmcts (see Figure 29B). These biosensors can be 
used in high-content screens to detect in vivo activation of enzymatic activity and to 
identify specific activity based on cleavage of a known recognition motif. This screen 
10 can be used for both live cell and fixed end-point assays, and can be combined with 
additional measurements to provide a multi-parameter assay. 

Thus, in another aspect the present invention provides recombinant nucleic acids 
encoding a protease biosensor, comprising: 

a. a first nucleic acid sequence that encodes at least one .detectable 
15 polypeptide signal; 

b. a second nucleic acid sequence that encodes at least one protease 
recognition site, wherein the second nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal; 
and 

20 c. a third nucleic acid sequence that encodes at least one reactant target 

sequence, wherein the third nucleic acid sequence is operatively linked to the second 
nucleic acid sequence that encodes the at least one protease recognition site. 

In this aspect, the first and third nucleic acid sequences are separated by the 
25 second nucleic acid sequence, which encodes the protease recognition site. 

In a fiirfher embodiment, the recombinant nucleic acid encoding a protease 
biosensor comprises a fourth nucleic acid sequence that encodes at least one product 
target sequence, wherein the fourth nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal. 
30 In a further embodiment, the recombinant nucleic acid encoding a protease 

biosensor comprises a fifth nucleic acid sequence that encodes at least one detectable 
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polypeptide signal, wherein the fifth nucleic acid sequence is operatively linked to the 
third nucleic acid sequence that encodes the reactant target sequence. 

In a preferred embodiment, the detectable polypeptide signal is selected firom 
the group consisting of fluorescent proteins, luminescent proteins, and sequence 

5 epitopes. In a most preferred embodiment, the first nucleic acid encoding a polypeptide 
sequence comprises a sequence selected from the group consisting of SEQ ID NOS: 35, 
37, 39, 41, 43, 45, 47, 49, and 5L 

In another preferred embodiment, the second nucleic acid encoding a protease 
recognition site comprises a sequence selected fi-om the group consisting of SEQ ID 

10 NOS: 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 
95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, and 121. In another 
preferred embodiment, the third nucleic acid encoding a reactant target sequence 
coTnprises a sequence selected from the group consisting of SEQ ID NOS: 123, 125, 
127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, and 151. 

15 In a most preferred embodiment, the recombinant nucleic acid encoding a 

protease biosensor comprises a sequence substantially similar to sequences selected 
from the group consistmg of SEQ ID N0S:1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 
27,29,31, and 33. 

In another aspect, the present invention provides a recombinant expression 
20 vector comprising nucleic acid control sequences operatively linked to the above- 
described recombinant nucleic acids. In a still further aspect, the present invention 
provides genetically engineered host cells that have been transfected with the 
recombinant expression vectors of the inventioiL 

In another aspect, the present invention provides recombinant protease 
25 biosensors comprising 

a. a first domain comprising at least one detectable polypeptide 

signal; 

b, a second domain comprising at least one protease recognition 

site; and 

30 c. ■ a third domain comprising at least one reactant target sequence; 

wherein the first domain and the third donriain are separated by the 
second domain. 
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Inherent in this embodiment is the concept that the reactant target sequence 
restricts the cellular distribution of the reactant, with redistribution of the product 
occurring after activation (ie: protease cleavage). This redistribution does not require a 
complete sequestration of products and reactants, as the product distribution can 
5 partially overlap the reactant distribution in the absence of a product targeting signal 
(see below^). 

In a preferred embodiment, the recombinant protease biosensor fiirther 
comprises a fourth domain comprising at least one product target sequence, wherein the 
fourth domain and the first domain are operatively linked and are separated from the 

10 third domain by the second domain. In another embodiment, the recombinant protease 
biosensor further comprises a fifth domain comprising at least one detectable 
polypeptide signal, wherein the fifth domain and the third domain are operatively 
linked .and are separated from the first domain by the second domain. 

In a preferred embodiment, the detectable polypeptide signal domain (first or 

15 fifth domain) is selected from the group consisting of fluorescent proteins, lumuiescent 
proteins, and sequence epitopes. In a most prefiared embodiment, the detectable 
polypeptide signal domain comprises a sequence selected from the group consisting of 
SEQ ID NOS:36, 38, 40, 42, 44, 46, 48, 50, and 52. 

In another preferred embodiment, the second domain comprising a protease 

20 recognition site comprises a sequence selected from the group consisting of SEQ ID 
NOS:54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 
98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, and 122. In another 
preferred embodiment, the reactant and/or target sequence domains comprise a 
sequence selected from the group consisting of SEQ ID NOS:124, 126, 128, 130, 132, 

25 134, 136, 138, 140, 142, 144, 146, 148, 150, and 152.. 

In a most preferred embodiment, the recombinant protease biosensor comprises 
a sequence substantially similar to sequences selected from the group consisting of 
SEQ ID NO:2, 4, 6; 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, and 34. 

In a still fiirther embodiment, the present invention provides methods and kits 

30 for automated analysis of cells, comprising using cells that possess the protease 
biosensors of the invention to identify compounds that affect protease activity. The 
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method can be combined with the other methods of the invention in a variety of 
possible multi-parametric assays. 

In these various embodiments, the basic protease biosensor is composed of 
multiple domains, including at least a first detectable polypeptide signal domain, at 
5 least one reactant target domain, and at least one protease recognition domain, wherein 
the detectable signal domain and the reactant target domain are separated by the 
protease recognition domain. Thus, the exact order of the domains in the molecule is 
not generally critical, so long as the protease recognition domain separates the reactant 
target and first detectable signal domain. For each domain, one or more one of the 

10 specified recognition sequences is present. 

In some cases, the order of the domains in the biosensor may be critical for 
appropriate targeting of product(s) and/or reactant to the appropriate cellular 
compartmentCs). For example, the targeting of products or reactants to the peroxisome 
requires that the peroxisomal targeting domain comprise the last three amino acids of 

15 the protein. Determination of those biosmsor in which the relative placement of 
targeting domains within the biosensor is critical can be determined by one of skill in 
the art through routine e^qjerimentation. 

Some examples of the basic organization of domains within the protease 
biosensor are shown in Figure 30. One of skill in the art will recognize that any one of 

20 a wide, variety of protease recognition sites, product target sequences, polypeptide 
signals, and/or product target sequences can be used in various combinations in the 
protein biosensor of the present invention, by substituting the appropriate coding 
sequences into the multi-domain construct. Non-limiting examples of such alternative 
sequences are shown in Figure 29A-29C. Similarly, one of skill in the art will 

25 recognize that modifications, substitutions, and deletions can be made to the coding 
sequences and the amino acid sequence of each individual domain within the biosensor, 
while retaining the function of the domain. Such various combinations of domains and 
modifications, substitutions and deletions to individual domains are within the scope of 
the invention. 

30 As used herein, the tenn "coding sequence" or a sequence which "encodes" a 

particular polypeptide sequence, refers to a nucleic acid sequence which is transcribed 
(in the case of DNA) and translated (in the case of mRNA) into a polypeptide in vitro 
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or in vivo when placed under the control of appropriate regulatory sequences. The 
boundaries of the coding sequence are determined by a start codon at the 5' (amino) 
terminus and a translation stop codon at the 3' (carboxy) terminus. A coding sequence 
can include, but is not Umited to, cDNA from prokaryotic or eukaiyotic mKNA, 
5 genomic DNA sequences from prokaryotic or eukaiyotic DNA, and synthetic DNA 
sequences. A transcription termination sequence will usually be located 3' to the coding 
sequence. 

As used herein, the term DNA "control sequmces" refers collectively to 
promoter sequences, ribosome binding sites, polyadenylation signals, transcription 
10 termination sequences, upstream regulatory domains, enhancers, and the like, which 
collectively provide for the transcription and translation of a coding sequence in a host 
cell. Not all of these control sequences need always be present in a recombinant vector 
so long as the DNA sequence of interest is capable of being transcribed and translated 
appropriately. 

15 As used herein, the term "operatively linked" refers to an arrangement of 

elements wherein the components so described are configured so as to perform their 
usual function. Thus, control sequences operatively Imked to a codmg sequence are 
capable of effecting the expression of the codmg sequence. The control sequences need 
not be contiguous with the coding sequence, so long as they ftmction to direct the 

20 expression thereof. Thus, for example, intervening untranslated yet transcribed 
sequences can be present between a promoter sequence and the coding sequence and 
the promoter sequence can still be considered "operatively linked" to the coding 
sequence. 

Furthermore, a nucleic acid coding sequence is operatively linked to another 
25 nucleic acid coding sequences when the coding region for both nucleic acid molecules 
are capable of expression in the same reading frame. The nucleic acid sequences need 
not be contiguous, so long as they are capable of expression in the same reading frame. 
Thus, for example, intervening coding regions can be present between the specified 
nucleic acid coding sequences, and the specified nucleic acid coding regions can still be 
30 considered "operatively linked". 

The intervening coding sequences between the various domains of the 
biosensors can be of any length so long as the function of each domain is retained. 

95 



BNSOOCID: cWO 0050B72A3_IA> 



wo 00/50872 



PCT/USOO/04794 



Generally, this requires that the two-dimensional and three-dimensional structure of the 
intervening protein sequence does not preclude the binding or interaction requirements 
of the domains of the biosensor, such as product or reactant targeting, binding of the 
protease of interest to the biosensor, fluorescence or luminescence of the detectable 
polypeptide signal, or binding of fluorescently labeled epitope-specific antibodies. 

One case where the distance between domains of the protease biosensor is 
important is where the goal is to create a fluorescence resonance energy transfer pair. In 
this case, the FRET signal will only exist if the distance between the donor and 
acceptor is sufficiently small as to allow energy transfer (Tsien, Heim and Cubbit, WO 
97/28261). The average distance between the donor and acceptor moieties should be 
between 1 nm and* 10 nm with a preference of between 1 mn and 6 nm. This is the 
physical distance between donor and acceptor. The intervening sequence length can 
vary considerably since the three dimensional structure of the peptide will determine 
the physical distance between donor and acceptor. 

"Recombinant expression vector" includes vectors that operatively link a 
nucleic acid coding region or gene to any promoter capable of effecting expression of 
the gene product. The promoter sequence used to drive expression of the protease 
biosensor may be constitutive (driven by any of a variety of promoters, including but 
not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of 
inducible promoters including, but not limited to, tetracycline, ecdysone, steroid- 
responsive). The expression vector must be replicable in the host organisms either as 
an episome or by integration into host chromosomal DNA. In a preferred embodiment, 
the expression vector comprises a plasmid. However, the invention is intended to 
include any other suitable expression vectors, such as viral vectors. 

The phrase "substantially similar " is used herein in reference to the nucleotide 
sequence of DNA, or the amino acid sequence of protein, having one or more 
conservative or non-conservative variations from the protease biosensor seqiiences 
disclosed herein, including but not limited to deletions, additions, or substitutions 
wherein the resulting nucleic acid and/or amino acid sequence is functionally 
equivalent to the sequences disclosed and claimed herein. Functionally equivalent 
sequences will function in substantially the same maimer to produce substantially the 
same protease biosensor as the nucleic acid and amino acid compositions disclosed and 
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claimed herein. For example, functionally equivalent DNAs encode protease 
biosensors that are tlie same as those disclosed herein or that have one or more 
conservative amino acid variations, such as substitutions of non-polar residues for other 
• non-polar residues or charged residues for similarly charged residues, or addition 
5 to/deletion from regions of the protease biosensor not critical for functionality. These 
changes include those recognized by those of skill in the art as substitutions, deletions, 
and/or additions that do not substantially alter the tertiary structure of the protein. 

As used herein, substantially similar sequences of nucleotides or amino acids 
share at least about- 70%-75% identity, more preferably 80-85% identity, and most 
10 preferably 90-95% identity. It is recognized, however, that proteins (and DNA or 
mRNA encoding such protems) containing less than the above-described level of 
homology (due to the degeneracy of the genetic code) or that are modified by 
conservative amino acid substitutions (or substitution of degenerate codons) are 
contemplated to be within the scope of the present invention. 
15 The terai "heterologous" as it relates to nucleic acid sequences such as coding 

sequences and control sequences, denotes sequences that are not normally associated 
with a region of a recombinant construct, and/or are not normally associated with a 
particular cell. Thus, a "heterologous" region of a nucleic acid construct is an 
identifiable segment of nucleic acid within or attached to another nucleic acid molecule 
20 that is not found in association with the other molecule in nature. For example, a 
heterologous region of a construct could mclude a codhig sequence flanked by 
sequences not found in association with the coding sequence in nature. Another 
example of a heterologous codmg sequence is a construct where the coding sequence 
itself is not found in nature (e.g., synthetic sequences havuxg codons different &om the 
25 native gene). Similarly, a host cell transformed with a construct which is not normally 
present in the host cell would be considered heterologous for purposes of this invention. 

Within this appUcation, unless otherwise stated, the techniques utilized may be 
found in any of several well-known references such as; Molecular Cloning: A 
Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press), 
30 Gene Expression Technology OVTethods in Enzymology, Vol. 185, edited by D. 
Goeddel, 1991. Academic Press, San Diego, CA), "Guide to Protem Purification" in 
Methods in Enzymology (M.P. Deutshcer, ed., (1990) Academic Press, Inc.); PCR 

97 



BNSDOCID: -eWO 0050872A3_tA> 



wo 00/50872 



PCT/USOO/04794 



Protocols: A Guide to Methods and Applications (Innis, et aL 1990. Academic Press, 
San Diego, CA), Culture of Animal Cells: A Manual of Basic Technique, 2'"' Ed. (R.L 
Freshney. 1987. Liss, Inc. New York, NY), Gene Transfer and Expression Protocols, 
pp. 109-128, ed. EJ. Murray, The Humana Press Inc., Clifton, N.J.), and the Ambion 

5 1998 Catalog (Ambion, Austin, TX). 

The biosensors of the present invention are constructed and used to transfect 
host cells using standard techniques in the molecular biological arts. Any nmnber of 
such techniques, all of which are within the scope of this invention, can be used to 
generate protease biosensor-encoding DNA constructs and genetically transfected host 

10 cells expressing the biosensors. The non-limiting examples that follow demonstrate 
one such technique for constructing the biosensors of the invention, 

EXAMPLE OF PROTEASE BIOSENSOR CONSTRUCTION AND USE: 

In the following examples, caspase-specific biosensors with specific product 

15 target sequences have been constructed using sets of 4 primers (2 sense and 2 
antisense). These primers have overlap regions at their temiini, and are used for PCR 
via a primer walking technique. (Sambrook, J., Fritsch, E.F. and Maniatis, T. (1989 ) 
Molecular Clonmg: A Laboratory Manual. Cold Spring Haibor Laboratory Press, Cold 
Spring Harbor, New York) The two sense primers were chosen to start firom the 5* 

20 polylinker (Bspl) of the GFP-containing vector (Clontech, California) to the middle of 
the designed biosensor sequence. The two antisense primers start from a 3' GFP vector 
site (Bam HI), and overlap with the sense primers by 12 nucleotides in the middle. 

PCR conditions were as follows: 94'^C for 30 seconds for denaturation, 55°C for 
30 seconds for annealing, and 72°C for 30 seconds for extension for 15 cycles. The 

25 primers have restriction endonuclease sites at both ends, faciUtating subsequent cloning 
of the resulting PCR product. 

The resulting PCR product was gel purified, cleaved at BspEl and BamHl 
restriction sites present in the primers, and the resulting firagment was gel purified. 
Similarly, the GFP vector (Clontech, San Francisco, CA) was digested at BspEl and 

30 BamHl sites in the polylinker. Ligation of the GFP vector and the PCR product was 
performed using standard techniques at 16°C overnight. E. coli cells were transfected 
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with the ligation mixtures using standard techniques. Transformed cells were selected 
on LB-agar with an appropriate antibiotic. 

Cells and transf actions. For DNA transfection, BHK cells and MCF-7 cells 
5 were cultured to 50-70% confluence in 6 well plates containing 3 ml of minimal 
Eagle's medium (MEM) with 10% fetal calf serum, 1 mM L-glutamine, 50 jag/ml 
streptomycin, 50 fig/ml penicillin, 0.1 mM non-essential amino acids, 1 mM sodixun 
pyravate and 10 pg/ml of bovuie insulin (for MCF-? cell only) at 37 in a 5% CO2 
incubator for about 36 hours. The cells were washed with serum free MEM media and 
10 incubated for 5 hours with 1 ml of transfection mixture containing 1 }ig of the 
appropriate plasmid and 4 fxg of lipofectimine (BRL) in the serum free MEM media. 
Subsequently, the transfection medium was removed and replaced with 3 ml of noraaal 
culture media. The transfected cells were maintained in growth medium for at least 16 
hours before performing selection of the stable cells based on standard molecular 
15 biology methods (Ausubel, et al 1995). 

Apoptosis assay. For apoptosis assays, the cells (BHK, MCF-7) stably 
transfected with the appropriate protease biosensor expression vector were plated on 
tissue culture treated 96-well plates at 50-60% confluence and cultured overnight at 
20 37°C, 5% CO2. Varying concentrations of cis-platin, staiuosporine, or paclitaxel in 
normal culture media were freshly prepared from stock and added to cell culture dishes 
to replace the old culture media. The cells were then observed with the cell screening 
system of the present invention at the indicated time points either as live cell 
experiments or as fixed end-point experiments. 

25 

1. Construction of 3-domain protease biosensors 

a. Caspase-3 biosensor with an annexin n reactant targeting domain 
(pljkGFP). 

The design of this biosensor is outlined ui Figure 31, and its sequence is shown 
30 in SEQ ID NO:l and 2. 
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Primers for Caspase 3, Product target sequence = none (CP3GFP-CYTO): 

1 ) TC A TC A TCC GGA GCT GGA GCC GGA GCT GGC CGA TCG OCT GTT 
AAA TCT GAA GGA AAG AGA AAG TGT GAC GAA GTT GAT GGA ATT 

5 GAT GAA GTA GCA (SEQ ID NO:153) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC ACC 

CTC CAA GCT GAG CTT GCA GAG GAT TTC GTG GAC AGT AGA 
CAT AGT ACT TGC TAC TTC ATC (SEQ ID NO:154) 

3) TCA TCA TCC GGA GCT GGA (SEQ ID NO:155) 
10 4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

This biosensor is restricted to the cytoplasm by the reactant target sequence. 
The reactant target sequence is the annexin II cytoskeletal binding domain 
(MSTVHEILCKLSLEGVHSTPPSA) (SEQ ID NO:124) (Figure 29C) (Eberhard et 

15 al. 1997. MoL Biol Cell 8:293a). The enzyme recognition site corresponds to two 
copies of the amino acid sequence DEVD (SEQ ID NO:60) (Figure 29B), which 
serves as the recognition site of caspase-3. Other examples with different nxunbers of 
protease recognition sites and/or additional amino acids from a naturally occurring 
protease recognition site are shown below. The signal domain is EGFP (SEQ ID 

20 NO:46) (Figure 29A) (Clontech, California). The parent biosensor (the reactant) is 
restricted to the cytoplasm by binding of the annexin n domain to the cytoskeleton, and 
is therefore excluded from the nucleus. Upon cleavage of the protease recognition site 
by caspase 3, the signal domain (EGFP) is released from the reactant targeting domain 
(annexin II), and is distributed throughout the whole volume of the cell, because it lacks 

25 any specific targeting sequence and is small enough to enter the nucleus passively. 
(Fig 32) 

The biosensor response is measured by quantitating the effective cytoplasm-to- 
nuclear translocation of the signal (see above). Measurement of the response is by one 
of several modes, including integrated or average nuclear region intensity, the ratio or 
30 difference of the integrated or average cytoplasm intensity to integrated or average 
nuclear intensity. The nucleus is defined using a DNA-specific dye, such as Hoechst 
33342. 
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This biosensor provides a measure of the proteoljrtic activity around the annexin 
II cytoskeleton binding sites within the cell. Given the dispersed nature of the 
cytoskeleton and the effectively diffixse state of cytosolic enzymes, this provides an 
effective measure of the cytoplasm in general. 

5 

Results & Discussion: 

Fig 32 illustrates images before and after stimulation of apoptosis by cis-platin 
in BHK cells, transfected with the caspase 3 biosensor. The images clearly illustrate 
accumulation of fluorescence in the nucleus. Generation of the spatial change in 

10 fluorescence is non-reversible and thus the timing of the assay is flexible. Controls for 
this biosensor include using a version in which the caspase-3-specific site has been 
omitted. In addition, disruption of the cytoskeleton with subsequent cell rounding did 
not produce the change in fluorescence distribution. Our experiments demonstrate the 
correlation of nuclear condensation with activation of caspase activity. We have also 

15 tested this biosensor in MCF-7 cells. A recent report measured a peak response in 
caspase-3 activity 6 h after stimulation of MCF-7 cells with etoposide accompanied by 
cleavage of PARP (Benjamin et al. 199iMoI Pharmacol. 53:446-50). However, 
another recent report found that MCF-7 cells do not possess caspase-3 activity and, in 
fact, the caspase-3 gene is functionally deleted (Janicke et aL 1998. J Biol Chem. 

20 273:9357-60). Caspase-3 activity was not detected with the caspase biosensor in MCF- 
7 cells after a 15 h treatment with 100 etoposide. 

Janicke et al., (1998) also indicated that many of the conventional substrates of 
caspase-3 were cleaved in MCF-7 cells upon treatment with staurosporine. Our 
experiments demonstrate that caspase activity can be measured using the biosensor in 

25 MCF-7 cells when treated with staurosporine. The maximum magnitude of the 
activation by staurosporine was approximately one-half that demonstrated with cis- 
platin in BHK cells. This also implies that the current biosensor, although designed to 
be caspase-3-:specific, is indeed specific for a class of caspases rather than uniquely 
specific for caspase-3. The most likely candidate is caspase-7 (Janicke et al., 1998). 

30 These experiments also demonstrated that the biosensor can be used in multiparameter 
experiments, with the correlation of decreases in mitochondrial membrane potential, 
nuclear condensation, and caspase activation. 
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We have specifically tested the effects of pacUtaxel on caspase activation using 
the biosensor. Caspase activity in BHK and MCF-7 cells was stimulated by paclitaxel. 
It also appears that caspase activation occurred after nuclear morphology changes. One 
caveat is that, based on the above discussions, the caspase activity reported by ttie 
5 biosensor in this assay is likely to be due to the combination of caspase-3 and, at least, 
caspase-7 activity. 

Consistent with the above results using staurosporine stimulation on MCF-7 
cells, paclitaxel also stimidated the activation of caspase activity. The magnitude was 
similar to that of staurosporine. This experiment used a much narrower range of 
10 paclitaxel than previous experiments where niiclear condensation appears to dominate 
the response. 

b. Caspase biosensor with the microtubule associated protein 4 
(MAP4) projection domain (CP8GFPNLS-SIZEPROJ) 

15 Another approach for restricting the reactant to the cytoplasm is to make the 

biosensor too large to penetrate the nuclear pores Cleavage of such a biosensor 
liberates a product capable of diffusing into the nucleus. 

The additional size required for this biosensor is provided by using the 
projection domain of MAP4 (SEQ ID Nb:142) (Figure 29C) (CP8GFPNLS- 

20 SIZEPROJ). The projection domain of MAP4 does not interact with microtubules on 
its own, and, when expressed, is diffusely distributed throughout the cytoplasm, but is 
excluded from the nucleus due to its size (-120 kD). Thus, this biosensor is distinct 
from the one using the frill length MAP4 sequence, (see below) One of skill in the art 
will recognize that many other such domains could be substituted for the MAP4 

25 projection domain, including but not limited to multiple copies of any GFP or one or 
more copies of any other protein that lacks an active NLS and exceeds the maximum 
size for diffusion into the nucleus (approximately 60 kD; Alberts, B., Bray, D., Rafi^ 
M., Roberts, K., Watson, J.D. (Eds.) Molecular Biolo^ of the Cell, third edition. New. 
York: Garland publishing, 1994. pp 561-563). The complete sequence of the resulting 

30 biosensor is shown in SEQ ID NO: 3-4. A sinailar biosensor with a different protease 
recognition domain is shown in SEQ ID NO:5-6. 

102 



wo 00/50872 



PCT/USOO/04794 



c, Caspase biosensor with a nuclear export signal 
Another approach for restricting the reactant to the cytoplasm is to actively 
restrict the reactant from the nucleus by using a nuclear export signal. Cleavage of 
such a biosensor liberates a product capable of diffusing into the nucleus, 
5 The Bacillus anthracis bacterium expresses a zinc metalloprotease protein 

complex called anthrax protease. Human mitogen activated protein kinase kinase 1 
(MEK 1) (Seger et al., J. Biol. Chem. 267:25628-25631, 1992) possesses an anthrax 
protease recognition site (amino acids 1-13) (SEQ ID NO:102) (Figure 29B) that is 
cleaved after amino acid 8, as well as a nuclear export signal at amino acids 32-44 
10 (SEQ ID NO:140) (Figure 29Q. Human MEK 2 (Zheng and Guan, J, Biol, Chem. 
268:11435-11439, -1993) possesses an anthrax protease recognition site comprising 
amino acid residues 1-16 (SEQ ID NO:104) (Figure 29B) and anuclear export signal 
at amino acids 36-48. (SEQ ID NO:148) (Figure 29C). 

The anthrax protease biosensor comprises Fret25 (SEQ ID NO:48) (Figure 
15 29A) as the signal, the anthrax protease recognition site, and the nuclear export signal 
from MEK 1 or MEK2. (SEQ ID NOS: 7-8 (MEKl); 9-10 (MEK2)) The intact 
biosensor will be retained in the cytoplasm by virture of this nuclear export signal (eg., 
the reactant target site). Upon cleavage of the fusion protein by anthrax protease, the 
NES will be separated from the GFP allowing the GFt to diffuse mto the nucleus, 

20 

2- Construction of 4- and 5-domaiu biosensors 

For all of the examples presented above for 3-domain protease biosensors, a 
product targeting sequence, including but not limited to those in Figure 29C, such as a 
nuclear localization sequence (NLS), can be operatively linked to the signal sequence, 
25 and thus cause the signal sequence to segregate from the reactant target domain after 
proteolytic cleavage. Addition of a second detectable signal domain, including but not 
limited to those in Figure 29 A, operatively linked with the reactant target domain is 
also useful in allowing measurement of the reaction by multiple means. Specific 
examples of such biosensors are presented below. 

30 

a. 4 domain biosensors 

1. Caspase biosensors with nuclear localization sequences 
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(pcasSnlsGFP; CP3GFPNLS-CYTO): 

The design of the biosensor is outlined in Figure 33, and its sequence is shown 
in SEQ ID NO:ll-12. PGR and cloning procedures were performed as described 
above, except that the following oligonucleotides were used: 
5 Primers for Caspase 3, Product target sequence = NLS (CP3GFPNLS-CYTO) : 

1) TCATCATCC GGAAGAAGGAAACGACAAAAG CGATCGGCT 
GTT AAA TCT GAA GGA AAG AGA AAG TGT GAG GAA GTT GAT GGA 
ATT GAT GAA GTA GCA (SEQ ID NO:157) 
10 2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC ACC 
CTC CAA GCT GAG GTT GCA CAG GAT TTC GTG GAG AGT AGA 
CAT AGT ACT TGC TAG TTC ATC (SEQ ID NO: 154) 

3 ) TCA TCA TCC GGA AGA AGG (SEQ ID NO:lS8) 

4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

15 

This biosensor is similar to that shown in SEQ ED NO:2 except upon 
recognition and cleavage of the protease recognition site, the product is released and the 
signal accumulates specifically in the iiucleus due to the presence of a nuclear 
localization sequence^ RRKRQK (SEQ ID NO;128) (Figure 29C)(Briggs et aL, J. 

20 Biol. Chem. 273:22745, 1998) attached to the signal. A specific benefit of this 
construct is that the products are clearly separated fi-om the reactants. The reactants 
remain in the cytoplasm, while the product of the enzymatic reaction is restricted to the 
nuclear compartment. The response is measured by quantitating the effective 
cytoplasm-to-nuclear translocation of the signal, as described above. 

25 With the presence of both product and reactant targeting sequences in the parent 

biosensor, the reactant target sequence should be dominant prior to activation (e.g., 
protease cleavage) of the biosensor. One way to accomplish this is by masking the 
product targeting sequence in the parent biosensor imtil after protease cleavage. In one 
such example, the product target sequence is functional only when relatively near the 

30 end of a polypeptide chain (ie: after protease cleavage). Alternatively, the biosensor 
may be designed so that its tertiary structure masks the function of the target sequence 
until after protease cleavage. Both of these approaches include comparing targeting 
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sequKices with different relative strengths for targeting. Using the example of the 
nuclear localization sequence (NLS) and annexin H sequences, different strengths of 
NLS have been tried with clone selection based on cytoplasmic restriction of the parent 
biosensor. Upon activation, the product targeting sequence will naturally dominate the 

5 localization of its associated detectable sequence domain because it is then separated 
fiom the reactant targeting sequence. 

An added benefit of using this biosensor is that the product is targeted, and thus 
concentrated, into a smaller region of the cell. Thus, smaller amounts of product are 
detectable due to the increased concentration of the product. This concentration effect 

10 is relatively insensitive to the cellular concentration of the reactant. The signal-to-noise 
ratio (SNR) of such a measurement is improved over the more dispersed distribution of 
biosensor #1. 

Similar biosensors that incorporate either the caspase 6. (SEQ ID NO:66) 
(Figure 29B) or the caspase 8 protease recognition sequence (SEQ ID NO:74) (Figure 
15 29B) can be made using the methods described above, but using the following primer 
sets: 

Primers for Caspase 6, Product target sequence = NLS (CP6GFPNLS- 
CYTO) 

1) TCA TCA TCC GGA AGA AGG AAA CGA CAA AAG CG^ TCG 
20 ACA AGA CTT GTT GAA ATT GAG AAC (SEQ ID NO:159) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC 
ACC CTC CAA GCT GAG CTT GCA CAG GAT TTC GTG GAC 
AGT AGA CAT AGT ACT GTT GTC AAT TTC (SEQ ID NO:160) 

25 3) TCA TCA TCC GGA AGA AGG (SEQ ID NO:158) 
4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

Primers for Caspase 8, Product target sequence = NLS (CPSGFPNLS-CYTO) 

1) TCA TCA TCC GGA AGA AGG AAA CGA CAA AAG CGA TCG 

30 TAT CAA AAA GGA ATA CCA GTT GAA ACA GAC AGC GAA GAG 
CAA CCT TAT (SEQ ID NO:161) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC ACC CTC 
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CAA GCT GAG CTT GCA CAG GAT TTC GTG GAC AGT AGA CAT AGT 
ACT ATA AGG TTG CTC (SEQ ID NO:162) 

3) TCA TCA TCC GGA AGA AGG (SEQ ID NO:158) 

4) GAA GAA GGA TCC GGC ACT (SEQ ID NO: 156) 

5 

The sequence of the resulting biosensors is shown in SEQ ID NO:13-14 
(Caspase 6) and SEQ ID NO: 15-16 (Caspase 8). Furthermore, multiple copies of the 
protease recognition sites can be inserted into the biosensor, yielding the biosensors 
shown in SEQ ID NO: 17-18 (Caspase 3) and SEQ ID NO:19-20 (Caspase 8). 

10 

2. Caspase 3 biosensor with a second signal domain 

An alternative embodiment employs a second signal domain operatively 
linked to the reactant target domain. In this example, full length MAP4 serves as the 
reactant target sequence. Upon recognition and cleavage, one product of the reaction, 

15 containing the reactant target sequence, remains bound to microtubules in the 
cytoplasm with its own unique signal, while the other product, containing the product 
target sequence, diffuses into the nucleus. This biosensor provides a means to measure 
two activities at once: caspase 3 activity using a translocation of GFP into the nucleus 
and microtubule cytoskeleton integrity in response to signaling cascades initiated 

20 during apoptosis, monitored by the MAP4 reactant target sequence. 

The basic premise for this biosensor is that the reactant is tethered to the 
microtubule cytoskeleton by virtue of the reactant target sequence comprising the full 
length microtubule associated protein MAP4 (SEQ ID NO: 152) (Figure 29C) In this 
case, a DEVD (SEQ ID NO:60) (Figure 29B) recognition motif is located between the 

25 EYFP signal (SEQ ID NO:44) (Figure 29A) operatively linked to the reactant target 
sequence, as well as the EBFP signal (SEQ ID NO:48) (Figure 29 A) operatively 
Imked to the C-terminus of MAP4. The resulting biosensor is shown in SEQ ID 
NO:21-22. 

This biosensor can also include a product targeting domain, such as an NLS, 
30 operatively linked to the signal domain. 

With this biosensor, caspase-3 cleavage still releases the N-terminal GFP, which 
undergoes translocation to the nucleus (directed there by the NLS). Also, the MAP4 
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fragment, which is still intact following proteolysis by caspase-3, continues to report on 
the integrity of the microtubule cytoskeleton during the process of apoptosis via the 
second GFP molecule fused to the C-terminus of the biosensor. Therefore, this single 
chimeric protein allows simultaneous analysis of caspase-3 activity and the 
5 polymerization state of the microtubule cytoskeleton during apoptosis induced by a 
variety of agents. This biosensor is also useful for analysis of potential drug candidates 
that specifically target the microtubule cytoskeleton, since one can determine whether a 
particular drug induced apoptosis in addition to affecting microtubules. 

This biosensor potentially combines a unique signal for the reactant, 

10 fluorescence resonance energy transfer (FRET) from signal 2 to signal 1, and a unique 
signal locahzation for the product, nuclear accumulation of signal 1. The amount of 
product generated will also be indicated by the magnitude of the loss in FRET, but this 
will be a smaller SiSIR than the combination of FRET detection of reactant and spatial 
localization of the product. 

15 FRET can occur when the emission spectrum of a donor overlaps significantly 

the absorption spectrum of an acceptor molecule, (dos Remedies, C.G., and P.D. 
Moens. 1995. Fluorescence resonance energy transfer spectroscopy is a reliable "ruler" 
for measuring structural changes in proteins. Dispelling the problem of the imknown 
orientation factor. J Struct Biol. 115:175-85; Emmanouilidou, E., A.G. Tesclaemachef, 

20 A.E. Pouli, L.L NichoUs, E.P. Seward, and G.A. Rutter. 1999. Imaging Ca(2+) 
concentration changes at the secretory vesicle surface with a recombinant targeted 
cameleon. CurrBiol. 9:915-918.) The average physical distance between the donor and 
acceptor molecules* should be between 1 nm and 10 nm with a preference of between 1 
ran and 6 nm. The intervening sequence length can vary considerably since the three 

25 . dimensional structure of the peptide will determine the physical distance between donor 
and acceptor. This FRET signal can be measured as (1) the amount of quenching of the 
donor in the presence of the acceptor, (2) the amount of acceptor emission when 
exciting the donor, and/or (3) the ratio between the donor and acceptor emission. 
Alternatively, fluorescent lifetimes of donor and acceptor could be measured. 

30 This case adds value to the above FRET biosensor by nature of the existence of 

the reactant targeting sequence. This sequence allows the placement of the biosensor 
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into specific compartments of the cell for a more direct readout of activity in those 
compartments such as the inner surface of the plasma membrane. 

The cytoplasmic second signal represents both original reactant plus one part of 
the product The nuclear first signal represents another product of the reaction. Thus the 

5 enzymatic reaction has the added flexibility in that it can be represented as (1) nuclear 
intensity; (2) the nucleus /cytoplasm ratio; (3) the nucleus /cytoplasm FRET ratio; (4) 
cytoplasmic /cytoplasmic FRET ratio. 

The present FRET biosensor design differs from previous FRET-based 
biosensors (see WO 97/28261; W09837226) in that it signal measurement is based on 

10 spatial position rather than intensity. The products of the reaction are segregated from 
the reactants. It is this change in spatial position that is measured. The FRET-based 
biosensor is based on the separation, but not to another compartment, of a donor and 
acceptor pair. The intensity change is due to the physical separation of the donor and 
acceptor upon proteolytic cleavage. The disadvantages of FRET-based biosensors are 

15 (1) the SNR is rather low and difBcult to measure, (2) the signal is not fixable. It must 
be recorded using living cells. Chemical fixation, for example with fonnaldehyde, 
cannot preserve both the parent and resultant signal; (3) the range of wavelengths are 
limiting and cover a larger range of the spectrum due to the presence of two 
fluorophores or a fluorophore and chromophore; (4) the construction has greater 

20 limitations in that the donor and acceptor must be precisely arranged to ensure that the 
distance falls within 1-10 nm. 

Benefits of the positional biosensor includes: (1) ability to concentrate the 
signal in order to achieve a higher SNR. (2) ability to be used with either Uving or fixed 
cells; (3) only a single fluorescent signal is needed; (4) the arrangement of the domains 

25 of the biosensor is more flexible. The only Umiting factor in the application of the 
positional biosensor is the need to define the spatial position of the signal which 
requires an imaging method with sufficient spatial resolution to resolve the difference 
between the reactant compartment and the product compartment- 
One of skill in the art will recognize that this approach can be adapted to report 

30 any desired combination of activities by simply making the appropriate substitutions 
for the protease recognition sequence and the reactant target sequence, including but 
not limited to those sequences shown in Figure 29A-C. 
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3. Caspase 8 biosensor with a nucleolar localization domain (CP8GFPNUC- 
CYTO) 

This approach (diagrammed in Figure 34) utilizes a biosensor for the detection 
5 of caspase-8 activity. In this biosensor, a nucleolar localization signal 
(RKRIRTYIXSCRRMKRSGFEMSRPIPSHLT) (SEQ ID NO:130) (Figure 29C) 
(Ueki et al., Biochem. Biophys. Res. Comm. 252:97-100, 1998) was used as the 
product target sequence, and made by PGR using the primers described below. The 
PGR product was digested with BspEl and Pvul and gel purified. The vector and the 
10 PGR product were ligated as described above. 

Primers for Caspase 8, Nucleolar localization signal (CP8GFPNUC-CYTO): 

1) TCA TCA TCC GGA AGA AAA CGT ATA CGT ACT TAG CTG AAG 
15 TCC TGC AGG GGG ATG AAA AGA (SEQIDNO:163) 

2) GAA GAA CGATCG AGT AAG GTG GGA AGG AAT AGG TCG AGA 
CAT CTG AAA ACC ACT TGT TTT CAT (SEQ ID NO:164) 

3) TCA TCA TCC GGA AGA AAA (SEQ ID NO: 165) 

4) GAA GAA GGA TCG AGT AAG (SEQ ID NO:166) 

20 The sequence of the resulting biosensor is shown in SEQ ID NO: 23-24. This 

biosensor includes the protease recognition site for caspase-8 (SEQ ID NO:74) 
(Figure 29B). A similar biosensor utilizes the protease recognition site for caspase-3. 
(SEQ ID NO:25-26) 

These biosensors could be used with other biosensors that possess the same 

25 product signal color that are targeted to separate compartments, such as CP3GFPNLS- 
GYTO. The products of each biosensor reaction can be uniquely measured due to 
separation of the products based on the product targeting sequences. Both products 
from CP8GFPNUC-CYT0 and GP3GFPNLS-CYT0 are separable due to the different 
spatial positions, nucleus vs. nucleolus, even though flie colors of the products are 

30 exactly the same. Assessing the non-nucleolar, nuclear region in ordeac to avoid the 
spatial overlap of the two signals would perform the measvu-ement of CP3GFPNLS in 
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the presence of CP8GFPNUC. The loss of the nucleolar region from the nuclear signal 
is insignificant and does not significantly affect the SNR. The principle of assessing 
multiple parameters using the same product color significantly expands the number of 
parameters that can be assessed simultaneously in living cells. This concept can be 

5 extended to other non-overlapping product target compartments. 

Measurement of translocation to the nucleolar compartment is performed by (1) 
defining a mask corresponding to the nucleolus based on a nucleolus-specific marker, 
including but not limited to an antibody to nucleolin (Lischwe et al., 1981. E:qy. Cell 
Res. 136:101-109); (2) defining a mask for the reactant target compartment, and (3) 

10 determining the relative distribution of the signal between these two compartments. 
This relative distribution could be represented by the difference in the two intensities 
or, preferably, the ratio of the intensities between compartments. 

The combination of multiple positional biosensors can be complicated if the 
reactant compartments are overlapping. Although each signal could be measured by 

15 simply determining the amount of signal in each product target compartment, higher 
SNR will be possible if each reactant is uniquely identified and quantitated. This higher 
SNR can be maximized by adding a second signal domain of contrasting fluorescent 
property. This second signal may be produced by a signal domain operatively linked to 
the product targeting sequence, or by FRET (see above), or by a reactant targeting 

20 sequence uniquely identifying it within the reactant compartment based on color, 
spatial position, or fluorescent property including but not limited to polarization or 
lifetime. Alternatively, for large compartments, such as the^cytoplasm, it is possible to 
place different, same colored biosensors in different parts of the same compartment. 

25 4. Protease biosensors with multiple copies of a second signal domain serving 
as a reactant target domain 

In another ' example, (CP8YFPNLS-SIZECFPn) increasing the size of the 
reactant is accomplished by using multiple inserts of a second signal sequence, for 
example, ECFP (SEQ ID NO:50) (Figure 29A) (Tsien, R-Y. 1998. Anna Rev 
30 Biochem. 67:509-44). Thus, the multiple copies of the second signal sequence serve as 
the reactant target domain by excluding the ability of the biosensor to dif&ise into the 
nucleus. This type of biosensor provides the added benefit of additional signal being 
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available per biosensor molecule. Aggregation of multiple fluorescent probes also can 
result in unique signals being manifested, such as FRET, self quenching, eximer 
formation, etc. This could provide a unique signal to the reactants. 

5 5. Tetanus/botulinum biosensor with trans-membrane targeting 

domain 

In an alternative embodiment, a trans-membrane targeting sequence is used to 
tether the reactant to cytoplasmic vesicles, and an alternative protease recognition site 
is used. The tetanus^otulinum biosensor (SEQ ID NOS:27-28 (cellubrevin); 29-30 

10 (synaptobrevin) consists of an NLS (SEQ ID NO:128) (Figure 29C), Fret25 ' signal 
domain (SEQ ID NO:52) (Figure 29 A), a tetanus or botulmum zinc metalloprotease 
recognition site from cellubrevin (SEQ ID NO:106) (Figure 29B) (McMahon et aL, 
Nature 364:346-349, 1993; Martin et aL, J. Cell Biol., in press) or synaptobrevm (SEQ 
ID NO:108) (Figure 29B) (GenBank Accession #U64520), and a trans-membrane 

15 sequence from cellubrevin (SEQ ID NO:146) (Figure 29C) or synaptobrevin (SEQ ID 
NO: 144) (Figure 29Q at the 3 '-end which tethers the biosensor to cellular vesicles. 
The N-terminus of each protein is oriented towards the cytoplasm. In the intact 
biosensor, GFP is tethered to the vesicles. Upon cleavage by the tetanus or botulinum 
zinc metalloproteas6, GFP will no longer be associated with the vesicle and is free to 

20 diffuse throughout the cytoplasm and the nucleus, 

b. 5-domain biosensors 

1. Caspase 3 biosensor with a nuclear localization domain and a 
second signal domain operatively linked to an annexin II domain 
25 The design of this biosensor is outlined in Figure 35, and the sequence 

is shown in SEQ ID NO:33-34. This biosensor differs from SEQ ID NO 11-12 by 
including a second detectable signal, ECFP (SEQ ID NO:50) (Figure 29A) (signal 2) 
operatively linked to the reactant target sequence. 

30 2. Caspase 3 biosensor with a nuclear localization sequence and a 

second signal domain operatively linked to a 1VIAP4 projection domain 
(CP3YFPNLS.CFPCYTO) 
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In this biosensor (SEQ ID NO:31-32), an NLS product targeting domain (SEQ 
ID NO: 128) (Figure 29C) is present upstream of an EYFP signal domain (SEQ ID 
NO:44) (Figure 29A). A DEVD protease recognition domain (SEQ ID NO:60) 
(Figure 29B) is between after the EYFP signal domain and before the MAP4 
5 projection domain (SEQ ID NO:142) (Figure 29C). 

Example 11. Fluorescent Biosensor Toxin Characterization 

As used herein, "toxin" refers to any organism, macromolecule, or organic or 
inorganic molecule or ion that alters normal physiological processes found Avifhin a 
10 cell, or any organism, macromolecule, or organic or inorganic molecule or ion that 
alters the physiological response to modulators of known physiological processes. 
Thus, a toxin can mimic a normal cell stimulus, or can alter a response to a normal cell 
stimulus. 

Living cells are the targets of toxic agents that can comprise organisms, 

15 macromolecules, or organic or inorganic molecules. A cell-based approach to toxin 
detection, classification, and identification would exploit the sensitive and specific 
molecular detection and amplification system developed by cells to sense minute 
changes in their external milieu. By combining the evolved sensing capability of cells 
with the luminescent reporter molecules and assays described herein, intracellular 

20 molecular and chemical events caused by toxic agents can be converted into detectable 
spatial and temporal luminescent signals. 

When a toxin interacts with a cell, whether it is at the cell surface or within a 
specific intracellular compartment, the toxin invariably undermines one or more 
components of the molecular pathways active within the cell. Because the cell is 

25 comprised of complex networks of interconnected molecular pathways, the effects of a 
toxin will likely be transmitted throughout many cellular pathways. Therefore, our 
strategy is to use molecular markers within key pathways likely to be affected by 
toxins, including but not Umited to cell stress pathways, metabolic pathways, signaling 
pathways, and growth and division pathways. 

30 We have developed and characterized three classes of cell based Imninescent 

reporter molecules to serve as reporters of toxic threat agents. These 3 classes are as 
follows; 
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(1) Detectors: general cell stress detection of a toxin; 

(2) Classifiers: perturbation of key molecular pathway(s) for detection and 
classification of a toxin; and 

(3) Identifiers: activity mediated detection and identification of a toxin or a 
5 group of toxins.- 

Thus, in another aspect of the present invention, living cells are used as 
biosensors to interrogate the environment for the presence of toxic agents. In one 
embodiment of this aspect, an automated method for cell based toxin characterization is 
disclosed that comprises providing an airay of locations containing cells to be treated 

10 with a test substance, wherein the cells possess at least a first luminescent reporter 
molecule comprising a detector and a second luminescent reporter molecule selected 
from the group consisting of a classifier or an identifier; contacting the cells with the 
test substance either before or after possession of the first and second lunmiescent 
reporter molecules by the cells; imaging or scanning multiple cells in each of the 

15 locations containing multiple cells to obtain luminescent signals fi-om the detector; 
converting the luminescent signals firom the detector into digital data to automatically 
measure changes in the localization, distribution, or activity of the detector on or in the 
cell, which indicates the presence of a toxin in the test substance; selectively imaging or 
scanning the locations containing cells that were contacted with test sample indicated to 

20 have a toxin in it to obtain luminescent signals fi-om the second reporter molecule; 
converting the luminescent signals firom the second luminescent reporter molecule into 
digital data to automatically measure changes in the localization, distribution, or 
activity of the classifier or identifier on or in the cell, wherein a change in the 
localization, distribution, structure or activity of the classifier identifies a cell pathway 

25 that is perturbed by the toxin present in the test substance, or wherein a change in the 
localization, distribution, structure or activity of the identifier identifies the specific 
toxin that is present in the test substance. In a preferred embodiment, the cells possess 
at least a detector, a classifier, and an identifier. In a fiirther preferred enibodiment, the 
digital data derived fi-om the classifier is used to determine which identifier(s) to 

30 employ for identifying the specific toxin or group of toxins. 

As used herein, the phrase "the cells possess one or more luminescent reporter 
molecules" means that the luminescent reporter molecule may be expressed as a 
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luminescent reporter molecule by the cells, added to tlie cells as a luminescent reporter 
molecule, or luminescently labeled by contacting the cell with a Imninescently labeled 
molecule that binds to the reporter molecule, such as a dye or antibody, that binds to the 
reporter molecule. The luminescent reporter molecule can be expressed or added to the 
5 cell either before or after treatment with the test substance. 

The ImninescOTt reporters comprising detectors, classifiers, and identifiers may 
also be distributed separately into single or multiple cell types. For example, one cell 
type may contain a toxin detector, which, when activated by toxic activity, implies to 
the user that the same toxin sample should be screened with reporters of the classifier 

10 or identifier type in yet another population of cells identical to or different firom the 
cells containing the toxin detector. 

The detector, classifier, and identifier can comprise the same reporter molecule, 
or they can comprise different reporters. 

Screening for changes in the localization, distribution, structure or activity of 

15 the detectors, classifiers, and/or identifiers can be carried out in either a high 
throughput or a high content mode. In general, a high-content assay can be converted 
to a high-throughput assay if the spatial information rendered by the high-content assay 
can be recoded in such a way as to no longer require optical spatial resolution on the 
cellular or subcellular levels. For example, a high-content assay for microtubule 

20 reorganization can be carried out by optically resolving luminescently labeled cellular 
microtubules and measuring their morphology (e.g„ bundled vs. non-bundled or 
normal). A high-throughput version of a microtubule reorganization assay would 
involve only a measurement of total microtubule poljmer mass after cellular extraction 
with a detergent That is, destabilized microtubules, being more easily extracted, would 

25 . result in a lower total microtubule mass luminescence signal than unperturbed or drug- 
stabilized luminescently labeled microtubules in another treated cell population. The 
luminescent signal emanating firom a domain containing one or more cells will 
therefore be proportional to the total microtubule mass remaining in the cells afl;er toxin 
treatment and detergent extraction. 

30 The methods for detecting, classifying, and identifying toxins can utilize the 

same screening methods described throughout the instant application, including but not 
limited to detecting changes in cytoplasm to nucleus translocation, nucleus or nucleolus 
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to cj^oplasm translocation, receptor internalization, mitochondrial membrane potential, 
signal intensity, the spectral response of the reporter molecule, phosphorylation, 
intracellular free ion concentration, cell size, ceU shape, cytoskeleton organization, 
metabolic processes, cell 'motility, cell substrate attachment, cell cycle events, and 

5 organellar structure and function. 

In all of these embodiments, the methods can be operated in both toxin-mimetic 
and toxin-inhibitory modes. 

Such techniques to assess the presence of toxins are useful for methods 
including, but not limited to, monitoring the presence of environmental toxins in test 

10 samples and for toxins utilized in chemical and biological weapons; and for detecting 
the presence and characteristics of toxins during environmental remediation, drag 
discovery, clinical applications, and during the normal development and manufacturing 
process by virtually any type of industry, includmg but not Umited to agriculture, food 
processing, automobile, electronic, textile, medical device, and petroleum industries. 

15 We have developed and characterized examples of luminescent cell-based 

reporters, distributed across the 3 sensor classes. The methods disclosed herein can be 
utilized in conjunction with computer databases, and data management, mining, 
retrieval, and display methods to extract meanirigful patterns from the enormous data 
set generated by each individual reporter or a combinatorial of reporters m response to 

20 - toxic agents. Such databases and bioinformatics methods include, but are not limited 
to, those disclosed in U.S. Patent Application Nos. 09/437,976, filed November 10, 
1999; 60/145,770 filed July 27, 1999 and U.S. Patent Application Serial No. to be 
assigned, filed Febraary 19, 2000. (98,068-C) 

Any cell type can be used to carry out this aspect of the invention, including 

25 prokaryotes such as bacteria and archaebacteria, and eukaryotes, such as single celled 
fungi (for example, yeast), molds (for example, DictyosteUum), and protozoa (for 
example, Euglena). Higher eukaryotes, including, but not limited to, avian, amphibian, 
insect, and mammalian cells can also be used. 
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Number I Name I Class 1 Cell Types I Response to model toxins 
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Positive Negative 


1 


Mitochondrial 
Potential 

[Donnan Equilibrium 
Dycl 


D 


• LLCPK (pig epithelia) 

• Rat primary hepatocytes 


Valinomycin Oligomycin 

(lOnM-lODMM) (10 nM) 

FCCP 

(lOnM-lOOpM) 


2 


Heat Shock Protein 
(Hsp 27, Hsp 70) 
GFP-chimera 


D 


• HeLa 

• 3T3 


Cadmiirai TNF-a 
(lOmM) (lOOng/^ml) 


3 


Tubulin- 
cytoskeleton 
[P-tubulin-GFP 
chimera] 


C 


• BHK 

• HeLa 

• LLCPK 


Paclitaxel Staurosporine 

(10 nM-20nM) (1 nM-1 ^iM) 

C7uracin-A 

(5 nM-lOfiM) 

Nocadazole 

(7 nM-12nM) 

Colchicine 

(5 nM-IOfiM) 

Vinblastine 

f5 nM-lOuM) 


4 


pp38 MAPK- stress 
signaUng 

[antibody and GFP- 
chimera] 


C 


• 3T3 

• LLCPK 


Anisomycin TNF-a 

(100 pM) (lOOng/ml) 

Cadmimn 

(10 MM) 


5 


NF-kB- stress 
signaling 

[antibody and GFP- 
chimera] 


C 


• HeLa 

• 3T3 

• BHK 

• SNB19 

• HepG2 

• LLCPK 


TNF-a Anisomycin 

(100ng/nil-O.38pg/inl) (10 nM-10 pM) 
IL-1 Cadmium 
(4ng/inl-.095pg/ml) (1-1 0 ^M) 
Nisin Penitrem A 

(2-1000 iig/nO) (10 (iM) 

Streptolysin Valinomycin 

(lOpg/ml) (IpM) 

Anisomycin 

(100 MM) 


6 


IkB 

[complement to NF- 
kBI 


c 


In many cell types 




1 


Tetanus Toxin 
[Protease activity- 
based sensor] 


I 


In many cell types 




8 


Anthrax UF 
[Protease activity- 
based sensor] 


I 


In many cell types 





Sensor Class: D= Detector of toxins; D= Classifier of toxins; I== Identifier of toxin or group of toxins 
The model toxins can generally be purchased firom Sigma Chemical Company (St Louis, MO) 



5 Examples of Detectors: This class of sensors provides a first line signal that 
indicates the presence of a toxic agent This class of sensors provides detection of 
general cellular stress that requires resolution limited only to the domain over which the 
measurement is being made, and they are amenable to high content screens as well. 
Thus, either high throughput or high content screening modes may be used, including 

10 but not limited to translocation of heat shock factors from the cj^oplasm to the nucleus. 
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and changes in mitochondrial membrane potential, intracellular free ion concentration 
detection (for example, Ca^***; HT), general metabolic status, cell cycle timing events, 
and organellar structure and function. 

5 L Mitochondrial Potential 

A key to maintenance of cellular homeostasis is a constant ATP energy charge. 
The cycling of ATP and its metabolites ADP, AMP, inorganic phosphate, and solution- 
phase protons is continuously adjusted to meet the catabolic and anabolic needs of the 
celL Mitochondria are primarily responsible for maintaining a constant energy charge 

10 throughout the entire cell. To produce ATP from its constituents, mitochondria must 
maintain a constant membrane potential within the organelle itself. Therefore, 
measurement of this electrical potential with specific luminescent probes provides a 
sensitive and rapid readout of cellular stress. 

We have utilized a positively charged cyanine dye, JC-1 (Molecular Probes, 

15 Eugene, OR), which diffuses into the cell and readily partitions into the mitochondrial 
membrane, for measurement of mitochondrial potential. The photophysics of JC-1 are 
such that when the probe partitions into the mitochondrial membrane and it experiences 
an electrical potential >140 mV, the probe aggregates and its spectral response is 
shifted to' the red. At membrane potential values <140 mV, JC-1 is primarily 

20 monomeric and its spectral response is shifted toward the blue. Therefore, the ratio of 
two emission wavelengths (645 nm and 530 nm) of JC-1 partitioned into mitochondria 
provides a sensitive and continuous measure of mitochondrial membrane potential. 

We have been making live cell measurements in a high throughput mode as the 
basis of a generalized indicator of toxic stress. The goal of o\ir initial experiments was 

25 to determine the ratio of J-aggregates of JC-1 dye to its monomeric form both before 
and after toxic stress. 
Procedure 

1 . Cells were plated and cultured up to overnight. 

2. Cells were stained with JC-1 (10 iig/ml) for 30 nainutes at 37^ C in a CO2 incubator. 
30 3. Cells were then washed quickly with HBSS at 37°C (2 times, 150 ^1/well), the 

toxins were added if required, and the entire plate was scanned in a plate reader. 
The JC-1 monomer was measured optimally with a 485 nm excitation/530 nm 
emission wavelength filter set, and the aggregates were best measured with a 590 
nm excitation/645 nm emission wavelength set. 

35 
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Results 

The mitochondrial potential within several types of living cells, and the effects 
of toxins on the potential were measured using the fluorescence ratio Em 645 (590)/ 

5 Em 530 (485) (excitation wavelengths in parentheses). For example, we measured the 
effect of 10 |iM valinomycin on the mitochondrial potential within LLCPK cells (pig 
epithelia). Within seconds of treatment, the toxin induced a more rapid and higher 
magnitude decrease (an approximately 50% reduction) in mitochondrial potential than 
that found in untreated cells. Hepatocytes were also determined to be sensitive to 

10 valinomycin, and the changes in mitochondrial potential were nearly complete within 
seconds to minutes after addition of various concentrations of the toxin- 

These results are consistent with mitochondrial potential being a model 
intracellular detector of cell stress. Because these measm-ements require no spatial 
resolution within . individual cells, mitochondrial potential measurements can be made 

15 rapidly on an entire cell array (e.g. high throughput). This means, for example, that 
complex arrays of many cell types can be probed simultaneously and continuously as a 
generalized toxic response. Such an indicator can provide a first line signal to indicate 
that a general toxic, stress is present in a sample. Further assays can then be conducted 
to more specifically identify the toxin using cells classifier or identifier type reporter 

20 molecules. 

2. Heat Shock Proteins 

Most mammalian cells will respond to a variety of environmental stimuli with 
the induction of a family of proteins called stress proteins. Anoxia, amino acid 

25 analogues, sulfhydryl-reacting reagents, transition metal ions, decouplers of oxidative 
phosphorylation, viral infections, ethanol, antibiotics, ionophores, non-steroidal 
antiinflammatory drugs, thermal stress and metal chelators are all inducers of cell stress 
protein synthesis, fimction, or both. Upon induction, cell stress proteins play a role in 
folding and unfolding proteins, stabilizing proteins in abnormal configurations, and 

30 repairing DNA damage. 

There is evidence that at least four heat shock proteins translocate fi-om the 
cytoplasm to tlie nucleus upon stress activation of the cell. These proteins include the 
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heat shock proteins HSP27 and HSP70, the heat shock cognate HSC70, and the heat 
shock transcription factor HSFL Therefore, measurement of cytoplasm to nuclear 
translocation of these proteins (and other stress proteins that translocate firom the 
cytoplasm to the nucleus upon a cell stress) will provide a rapid readout of cellular 
5 stress. 

We have tested the response of an HSP27-GFP biosensor (SEQ ID 159-170) m 
two cell Unes (BHK and HeLa) using a library of heavy metal chemical compounds as 
biological toxin stimulants to stress the cells. Briefly, cells expressing the HSP27-GFP 
biosensor are plated into 96-well microplates, and allowed to attach. The cells are then 
10 treated with a panel of cell stress-inducing compounds- Exclusively cytoplasmic 
localization of the fusion protem was found in imstimulated cells. 

Other similar heat shock protein biosensors (HSP-70, HSC70, and HSFl fused 
to GFP) can be used as detectors, and are shown in SEQ ID NO: 171-176. 

15 

Examples of Classifiers: 

This class of sensors detects the presence o^ and further classifies toxins by 
identifying the cellular pathway (s) perturbed by the toxin. As such, this suite of sensors 
can detect and/or classify toxins into broad categories, including but not limited to 

20 "toxins affecting signal transduction," "toxins affecting the cytoskeleton," and "toxins 
affecting protein synthesis". Either high throughput or high content screening modes 
may be used. Classifiers can comprise compounds mcluding but not limited to tubulm, 
microtubule-associated proteins, actm, actin-binding proteins including but not limited 
to vinculin, a-actinin, actin depolymerizing factor/cofilin, profilin, and myosin; NF-kB, 

25 IkB, GTP-binding proteins includmg but not limited to rac, rho, and cdc42, and stress- 
activated protein kinases includmg but not lunited to p38 mitogen-activated protein 
kinase. 

L Tuhulin-cvtoskeleton 
30 The cell cytoskeleton plays a major role in cellular functions and processes, 

such as endo- and- exocytosis, vesicle transport, and mitosis. Cytoskeleton-affecting 
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toxins, of proteinaceous and non-proteinaceous form, such as C2 toxin, and several 
classes of enterotoxins, act either directly on the cytoskeleton, or indirectly via 
regulatory components controlling the organization of the cytoskeleton. Therefore, 
measurement of structural changes in the cytoskeleton can provide classification of the 

5 toxin into a class of cyto skeleton-affecting toxins. This assay can be conducted in a 
high content mode, as described previously, or in a high throughput mode. For high 
throughput as discussed previously. 

Such measurements will be valuable for identification of toxins including, but 
not limited to anti-microtubule agents, agents that generally affect cell cycle 

0 progression and cell proliferation, intracellular signal transduction, and metabolic 
processes. 

For microtubule disruption assays, LLCPK cells stably transfected with a 
tubulin-GFP biosensor plasmid were plated on 96 well cell culture dishes at 50-60% 
confluence and cultured overnight at 37 "^C, 5% CO2. A series of concentrations (10- 

5 500 nM) of 5 compounds (paclitaxel, curacin A, nocodazole, vinblastine, and 
colchicine) m normal culture media were fi-eshly prepared fi-om stock, and were added 
to cell culture dishes to rq3lace the old culture media. The cells were then observed 
with the cell screening system described above, at a 12 hour time point. 

Our data indicate that the tubulin chimera localizes to and assembles into 

0 microtubules throughout the cell. The microtubule arrays in cells expressing the 

chimera respond as follows to a variety of anti-microtubule compounds: 

Drug Response 

Vinblastine Destabilization 

Nocodazole DestabiUzation 

5 Pachtaxel Stabilization 

Colchicine Destabilization 

Curacin A Destabilization 

Similar data were obtained using cells expressing the tubulin biosensor that 
0 were patterned onto cell arrays (such as those described in U.S. Patent Application 
Serial No. 08/865,341 filed May 29, 1997, incorporated by reference herein in its 
entirety) and dosed as above. 
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2. NF'JcB 

NF-kB is cytoplasmic at basal levels of stimulation, but upon insult translocates 
to the nucleus where it binds specific DNA response elements and activates 
transcription of a number of genes. Translocation occurs when IkB is degraded by the 
5 proteosome in response to specific phosphorylation and ubiquitination events. HcB 
nomially retains NF-kB in the cytoplasm via direct interaction with the protein, and 
masking of the NLS sequence of NF-kB. Therefore, although not the initial or defining 
event of the whole signal cascade, NF-kB translocation to the nucleus can serve as an 
indicator of cell stress. 

10 We have generated an NF-kB-GFP chimera for analysis in live cells. This was 

accomplished using standard polymerase chain reaction techniques using a 
characterized NF-kB p65 cDNA purchased firom Invitrogen (Carlsbad, CA) fused to an 
EYFP pgr amplimer that was obtained fi-om Clontech Laboratories (Palo Alto, CA). 
The resulting chimera is shown in SEQ ID NO:177-178. The two PGR products were 

15 ligated into an eukaryotic expression vector designed to produce the chimeric protein at 
high levels using the ubiquitous CMV promoter. 

NF-kB immiinolocalization 

20 For further studies, we characterized endogenous NF-kB activation by 

immunolocalization in toxin treated cells. The NF-kB antibodies used in this study 
were purchased firom Santa Cruz Biotechnology, Inc. (Santa Cruz, CA), and secondary 
antibodies are from Molecular Probes (Eugene, OR). 

For the 3T3 and SNB19 cell types, we determined the effective concentrations 

25 that yield response levels of 50% of the maximum (EC50), expressed in units of mass 
per volume (ng/ml) and units of molarity. Based on molecular weights of 17 kD for 
both TNFa and IL-la, the EC50 levels for these two compounds with 3T3 and SNB19 
cell types are given in units of molarity in Table 1. Our results demonstrated 
reproducibility of the relative responses jfrona zero to maximum dose, but from sample 

30 to sample there have been occasional shifts in the baseline intensities of the response at 
zero concentration. 
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For these experiments, either 10 or 100 TNFa-treated 3T3 or SNB19 cells/well 
were tested. On the basis of the standard deviations measured for these samples, and 
by taking t-values for the student's t-test, we h^ve estimated the minimum detectable 
doses for each case of cell type, compound, number of cells per well, and for different 

5 choices of how many wells are sampled per condition. The latter factor detenrdnes the 
nxnnber of degrees of freedom that are provided in the sample of data, Licreasing the 
number of wells from 4 to 16, and mcreasing the number of cells per well from 10 to 
100, improves the minimum detectable doses considerably- For 3T3 cells, which show 
lower minimum detectable doses than the SNB19 cells, and for the case of 1% false 

10 negative and 1% false positive rates, we estimate that 100 cells per well and a sampling 
of 12 or 16 wells are sufficient to detect a dose approximately equal to the EC50 value 
of 0.15 ng/ml. If the false positive rate is relaxed to 20%, a concentration of 
approximately half that value can be detected (0.83 ng/ml). One hundred cells can 
conveniently be sampled over a cell culture surface area of less than 1 mm . 

15 

Table 1 . EC50 levels for TNFa and IL-1 a (based on molecular weights of 17 kD for 
both) 



Compoimd 


Cell Type 


EC50 f 10 moles/liter) 










3T3 


8.8 




SNB19 


5.9 








IL-la 


3T3 


0.24 




SNB19 


59 



3. Phospho-pSS Mitogen Activated Protein Kinase (pp38MAPK) 

MAPKs play a role in not only cell growth and division, but as mediators of 
cellular stress responses. One MAPIC, p38, is activated by chemical stress inducers 
such as hyper-osmolar sorbitol, hydrogen peroxide, arsenite, cadmium ions, 
25 anisomycin, sodiiun salicylate, and LPS. Activation of p38 is also accompanied by its 
translocation into the nucleus from the cytoplasm. 
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MAPK p38 lies in a pathway that is a cascade of kinases. Thus, p38 is a 
substrate of one or more kinases, and it acts to phosphorylate one or more substrates in 
time and space within the living cell. 

The assay we present here measures, as one of its parameters, p38 activation 

5 using immunolocalization of the phosphorylated form of p38 in toxin-treated cells. The 
assay was developed to be flexible enough to include the simultaneous measurement of 
other parameters within the same individual cells. Because the signal transduction 
pathway mediated by the transcription factor NF-kB is also known to be involved in the 
cell stress response, we included the activation of NF-kB as a second parameter in the 

10 same assay. 

Our experiments demonstrate an immunofluorescence approach can be used to 
measure p38 MAPK activation either alone or in combination with NF-kB activation in 
the same cells. Multiple cell types, model toxins, and antibodies were tested, and 
significant stimulation of both pathways was measured in a high-content mode. The 

15 phospho-p38 antibodies used in this study were purchased from Sigma Chemical 
Company (St. Louis, MO). We report that at least two cell stress signaling pathways 
can not only be measured simultaneously, but are differentially responsive to classes of 
model toxins. Figure 36 shows the differential response of the p38 MAPK and NF-kB 
pathways across three model toxins and two different cell types. Note that when added 

20 alone, three of the model toxins (ILla, TNFa and Anisomycin) can be differentiated 
by the two assays as activators of specific pathways. 

Tk!B chimera 

IkB degradation is the key event leadmg to nuclear translocation of NF-kB and 
25 activation of the NFkB-mediated stress response. We have chosen this sensor to 
complement the NF-kB sensor as a classifier in a high-throughput mode: the 
measurement of loss of signal due* to degradation of the IkB-GFP fusion protein 
requires no spatial resolution within individual cells, and as such we envision HcB 
degradation measurements being made rapidly on an entire cell substrate, 
30 This biosensor is based on fusion of the first 60 amino acids of DcB to the 

Fred25 variant of GFP. SEQ ID 179-180 This region of IkB contains all the regulatory 
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sequences, including phosphorylation sites and ubiquitination sites, necessary to confer 
proteosome degradation upon the biosensor. Knowing this, stimulation of any pathway 
that would typically lead to NFkB translocation results in degradation of this biosensor. 
Monitoring the fluorescence intensity of cells expressing IlcB-GFP identifies the 
5 degradation process. 



Examples of Identifiers: 

In our toxin identification strategy, the first two levels of characterization enstire 

10 a rapid readout of toxin class without sacrificing the ability to detect many new mutant 
toxins or dissect several complex mixtures of known toxins. The third level of 
biosensors are identifiers, which can identify a specific toxin or group of toxins. In one 
embodiment, an identifier comprises a protease biosensor that responds to the activity 
of a specific toxin. Other identifiers are produced with reporters/biosensors specifio to 

15 their activities. These include, but are not limited to post-translational modifications 
such as phosphorylation or ADP-ribosylation, translocation between cellular organelles 
or compartments, effects on specific organelles or cellular components (for example, 
membrane permeabilization, cytoskeleton rearrangem^it, etc.) 

ADP-ribosvlating toxins — These toxins include Pseudomonas toxin A, diptheria 

20 toxin, botulinum toxin, pertussis toxin, and cholera toxin. For example, C. botulinum 
C2 toxiri induces the ADP-ribosylation of Argl77 in the cytoskeletal protein actin, thus 
altering its assembly properties. Besides the construction of a classifier assay to 
measure actin-cytoskeleton regulation, an identifier assay can be constructed to detect 
the specific actin ADP-ribosylation. Because the ADP-ribosylation induces a 

25 conformational change that no longer permits the modified actin to polymerize, this 
conformational change can be detected intracellularly in several possible ways using 
luminescent reagents. For example, actin can be luminescently labeled using a 
fluorescent reagent with an appropriate excited state lifetime that allows for the 
measurement of the rotational diffusion of the intracellular actin using steady state 

30 fluorescence anisotropy. That is, toxin-modified actin will no longer be able to 
assemble into rigid filaments and will therefore produce only luminescent signals with 
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relatively low airisotropy, which can be readily measured with an imaging system. In 
another embodiment, actin can be labeled with a polarity-sensitive fluorescent reagent 
that reports changes in actin-conformation through spectral shifts of the attached 
reagent. That is, toxin-treatment will induce a conformational change in intracellular 
5 actin such that a ratio of two fluorescence wavelengths will provide a measure of actin 
ADP-ribosylatioiL 

nvtotoxic pVinsnholipases - Several gram-positive bacterial species produce 
cytotoxic phosphoUpasBS. For example, Clostridium perfringens produces a 
phosphoUpase C specific for the cleavage of phosphoinositides. These 

10 phosphoinositides (B.g., inositol 1,4.5-trisphosphate) induce the release of calcium ions 
from intracellular organelles. An assay that can be conducted as either high-content or 
high-throughput can be constructed to measure the release of calcium ions using 
fluorescent reagents that have altered spectral properties when complexed with the 
metal ion. Therefore, a direct consequence of the action of a phospholipase C based 

15 toxin can be measured as a change m cellular calcimn ion concentration. 

F.vfoliative toxins - These toxins are produced by several Staphylococcal 
species and can consist of several serotypes. A specific identifier for these toxins can be 
constructed by measuring the morphological changes in their target organelle, the 
^desmosome, which occur at the junctions between cells. / The exfoliative toxins are 

20 known to change the morphology of the desmosomes into two smaller components 
called hemidesmosomes. In the high-content assay for exfoliative toxins, epithelial cells 
whose desmosomes are luminescently labeled are subjected to image analysis. An 
method that detects the morphological change between desmosomes and 
hemidesmosomes is used to quantify the activity of the toxins on the cells. 

25 Most of these identifiers can be used in high throughput assays requiring no 

spatial resolution, as well as in high content assays. 

Several biological threat agents act as specific proteases, and thus we have 
focused on the development of fluorescent protein biosensors that report the proteolytic 
cleavage of specific amino acid sequences found within the target proteins. 

30 A number of such protease biosensors (including FRET biosensors) are 

disclosed above, such as the caspase biosensors, anthrax, tetanus, Botulinum, and the 
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zinc metalloproteases. FRET is a powerful technique in that small changes in protein 
conformation, many of which are associated with toxin activity, can not only be 
measured with high precision in time and space within living cells, but can be measured 
in a high-throughput mode, as discussed above. 

5 As described above, one of skill in the art will recognize that the protease 

biosensors of this aspect of the invention can be adapted to report the activity of any 
protease, by a substitution of the ^propriate protease recognition site in any of the 
constructs (see Figure 29B). As disclosed above, these biosensors can be used in high- 
content or high throughput screens to detect in vivo iactivation of enzymatic activity by 
10 toxins, and to identify specific activity based on cleavage of a known recognition motif. 
These biosensors can be used in both Uve cell and fixed end-point assays, and can be 
combined with additional measurements to provide a multi-parameter assay. 

Anthrax LF 

15 Anthrax is a well-known agent of biological warfare and is an excellent target 

for development of a biosensor in the identifier class. Lethal factor (LF) is one of the 
protein components that confer toxicity to anthrax, and recently two of its targets within 
cdlls were identified. LF is a metalloprotease that specifically cleaves Mekl and Mek2 
proteins, kinases that are part of the MAP-klnase signaling pathway. Construction of 

20 lethal factor protease biosensors are described above. (SEQ ID NO:7-8; 9-10) Green 
fluorescent protein (GFP) is fiised in-frame at the amino tenninus of either Mekl or 
Mek2 (or both), resulting in a chimeric protein that is retained in the cytoplasm due to 
the presence of a nuclear export sequence (NES) present in both of the target 
molecules. Upon cleavage by active lethal factor, GFP is released from the chimera and 

25 is free to diffuse into the nucleus. Therefore, measming the accumulation of GFP in the 
nucleus provides a direct measure of LF activity on its natural target, the living cell. 

While a preferred form of the invention has been shown in the drawings and 
described, since variations in the preferred form will be apparent to those skilled in the 
art, the invention should not be construed as limited to the specific form shown and 
30 described, but instead is as set forth in the claims. 
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CLAIMS 

We claim: 

1 , An automated method for cell based toxin characterization comprising 

-providing an array of locations containing cells to be treated with a test 
5 substance, wherein the cells possess at least a first luminescent reporter molecule 
comprising a detector and a second luminescent reporter molecule selected from the 
group consisting of a classifier or an identifier, 

-contacting the cells with the test substance either before or after possession of 
the first and second luminescent reporter molecules by the cells; wherein the 
10 locaUzation. distribution, stracture, or activity of the first and second lumiriescent 
reporter molecule is modified when the cell is contacted with the toxin, 

-imaging or scanning multiple cells in each of the locations containing multiple 
cells to obtain luminescent signals firom the detector, 

-converting the luminescent signals fix)m the detector into digital data; 
15 -utilizing the digital data firom the detector to automatically measure the 

localization, distribution, or activity of the detector on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the detector indicates the 
presence of a toxin va. the test substance; 

-selectively, imaging or scanning the locations containing cells that were 
20 contacted with test sample indicated to have a toxin in it to obtain luminescent signals 
fi-om the second reporter molecule; 

-converting the luminescent signals fixim the second luminescent reporter 

molecule into digital data; 

-utilizing the digital data fiiom the second luminescent reporter molecule to 

25 automatically measure the localization, distribution, or activity of the classifier or 
identifier on or in the ceU, wherein a change in the localization, distribution, structure 
or activity of the classifier identifies a cell pathway that is perturbed by the toxin 
present in the test substance, or wherein a change in the localization, distribution, 
structure or activity of the identifier identifies the specific toxin or group of toxins that 

30 are present in the test substance. 
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2. The method of claim 1 wherein the second limiinescent reporter molecule is a 
classifier, and the digital data derived from the classijBer is used to select an appropriate 
identifier for identification of the specific toxin or group of toxins. 

5 3. An automated method for cell based toxin characterization comprising 

-providing an array of locations containing cells to be treated with a test 
substance, wherein the cells possess at least a first luminescent reporter molecule 
comprising a detector, a second luminescent reporter molecule conaprising a classifier, 
and a third luminescent reporter molecule comprising an identifier; 
10 -contacting the cells with the test substance either before or after possession of 

the first second, and third luminescent reporter molecules by the cells; wherein the 
localization, distribution, structure, or activity of the first, second, and third luminescent 
reporter molecules is modified when the cell is contacted with the toxin, 

-imaging or scanning multiple cells in each of the locations containing multiple 
15 cells to obtain luminescent signals from the detector; 

-converting the luminescent signals from the detector into digital data; 
-utilizing the digital data from the detector to automatically measure the 
locaUzation, distribution, or activity of the detector on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the detector indicates the 
20 presence of a toxin in the test substance; 

-selectively imaging or scanning the locations conta in i n g cells that were 
contacted with test. sample indicated to have a toxin in it to obtain luminescent signals 
from the classifier; 

-converting the luminescent signals from the classifier into digital data; 
25 -utilizing the digital data from the classifier to automatically measure the 

locaUzation, distribution, or activity of the classifier on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the classifier identifies a cell 
pathway that is perturbed by the toxin present in the test substance; 

—selectively imaging or scanning the locations containing cells that were 
30 contacted with test sample indicated to have a toxin in it to obtain luminescent signals 
from the identifier, 

-converting the luminescent signals from the identifier into digital data; and 
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-Utilizing the digital data from the identifier to automatically measure the 
localization, distribution, or activity of the identifier on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the identifier identifies the 
specific toxin or group of toxins that is present in the test substance. 

5 

4, The method of claim 3 wherein the digital data derived from the classifier is 
used to select an appropriate identifier for identification of the specific toxin or group 
of toxins. 

10 5. The method of any one of claim 1-4 wherein the detector comprises a molecule 
selected from the group consisting of heat shock proteins and compounds that respond 
to changes in mitochondrial membrane potential, intracellular free ion concentration, 
cytoskeletal organization, general metabolic status, cell cycle timing events, and 
organellar stmcture and function. 

15 

6. The method of any one of claim 1-5 wherein the classifier comprises a molecule 
selected from the group consisting of tubulin, microtubule-associated proteins, actin, 
actin-binding proteins, NF-kB, IkB, and stress-activated kinases. 

20 7. The method of any one of claim 1-6 wherein the cell pathway is selected from 
the group consisting of cell stress pathways, cell metabolic pathways, cell signaling 
pathways, cell growth pathways, and cell division pathways. 

8. The method of claim 1, wherein the second luminescent reporter molecule is an 
25 identifier, and the identifier identifies a toxin or group of toxins selected from the group 

consisting of proteases, ADP-ribosylating toxins, cytotoxic phosphoUpases, and 
exfoliative toxins. 

9. The method of any one of claim 3-7, wherein the identifier identifies a toxin or 
30 group of toxins selected from the group consisting of proteases, ADP-ribosylating 

toxins, cytotoxic phospholipases, and exfoliative toxins. 
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10, The method of any of claims 1-9 wherein the change in the localization, 
distribution, structure or activity of the first, second, or third luminescent reporter 
molecules is selected firom the group consisting of cytoplasm to nucleus translocation, 
nucleus or nucleolus to cytoplasm translocation, receptor internalization, mitochondrial 
5 membrane potential, loss of signal, the spectral response of the reporter molecule, 
phosphorylation, intracellular firee ion concentration, cell size, cell shape, cytoskeleton 
organization, metabolic processes, cell motility, cell substrate attachment, cell cycle 
events, and organellar structure and function. 

10 11. The method of any one of claims 1-10, wherein the imaging or scanning 
multiple cells in each of the locations containing multiple cells to obtain luminescent 
signals firom the detector is carried out in a high throughput mode, 

12. The method of any one of claims 1-10, wherein the imaging or scanning 
15 multiple cells in each of the locations containing multiple cells to obtain luminescent 

signals from the detector is carried out in a higji content mode. 

13. The method of claim 1-10 wherein the selective imaging or sca nnin g of the 
locations containing cells'^that were contacted with test sample indicated to have a toxin 

20 in it to obtain luminescent signals firom the second or third reporter molecule is carried 
out in a high throughput mode, 

14. The method of claim 1-10 wherein the selective imaging or scanning of the 
locations containing cells that were contacted with test sample indicated to have a toxin 

25 in it to obtain luminescent signals firom the second or third reporter molecule is carried 
out in a high content mode. 

15. The method of any one of claims 1-14 fiirtheir comprising providing a digital 
storage media for data storage and archiving. 

30 

16. The method of claim 15 further comprising a means for automated control, 
acquisition, processing and display of results. 
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17. A computer readable storage medium comprising a program containing a set of 
instructions for causing a cell screening system to execute the method of any one of 
claims 1-16, wherein the cell screening system comprises an optical system with a stage 
5 adapted for holding a plate containing cells, a means for moving the stage or the optical 
system, a digital camera, a means for directing light emitted from the cells to the digital 
camera, and a computer means for receiving and processing the digital data from the 
digital camera. 

10 18. A kit for cell based toxin detection comprising: 

(a) at least one reporter molecule, wherein the localization, distribution, 
structure, or activity of the reporter molecule is modified when the cell is contacted 

. with a toxin; 

(b) instmctions for using the reporter molecule to carry out the method of 
15 any one of claims 1-16 to detect toxins in a test substance. 

19. The kit of claim 18 fiirther comprising the computer readable storage medium 
of claim 17. 

20 20. An automated method for cell based toxin characterization comprising 

-providing a first array of locations containing cells to be treated with a test 
substance, wherein the cells possess a least a first luminescent reporter molecule 
comprising a reporter molecule selected from the group consisting of detectors and 
classifiers; 

25 -contacting the cells with the test substance either before or after possession of 

the first luminescent reporter molecule by the cells; wherein the localization, 
distribution, structure, or activity of the first luminescent reporter molecule is modified 
when the cell is contacted with the toxin, 

-imaging or scanning multiple cells in each of the locations containing multiple 
30 cells to obtain luminescent signals from the detector; 

-converting the luminescent signals from the detector into digital data; 
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-utilizing the digital data from the detector to automatically measure the 
localization, distribution, or activity of the detector on or in the cell, wherein a change 
in the localization, distribution, structure or activity of the detector indicates the 
presence of a toxin in the test substance, 

5 -providing a second array of locations containing cells to be treated with the test 

substance, wherem the cells possess a least a second luminescent reporter molecule 
comprising a reporter molecule selected from the group consisting of classifiers and 
identifiers, and wherem the second array of locations containing cells can comprise 
either the same or a different cell type as the first array of locations containing cells; 

10 -contacting the second array of locations containing cells with the test substance 

either before or after possession of the second luminescent reporter molecule by the 
cells; wherein the localization, distribution, structure, or activity of the second 
luminescent reporter molecule is modified when the cell is contacted with the toxin; 
-utilizing the digital data from the second luminescent reporter molecule to 

15 automatically measure the localization, distribution, or activity of the classifier or 

identifier on or in the cell, wherein a change in the localization, distribution, structure 
or activity of the classifier identifies a cell pathway that is perturbed by the toxin 
present in the test substance, or wherein a change in the localization, distribution, 
structure or activity of the identifief identifies the specific toxm or group of toxins that 

20 are present in the test substance. 
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Figure 1 
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Figure 2 
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Figure 3 
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Figure 5 
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Figure 6 
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Figure 7 
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Figure 9 
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Figure 13 
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Figure 15 
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Figure 25 
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Figure 27 
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Figure 28 
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1. SIGNAL SEQUENCES 



PCT/USOO/04794 



EPITOPE 


SEQUENCE 


SEQ ID NO: 


REFERENCE 


FLAG epitope 


5 ' GACTACAAAGACGACG 

TV 71 A w . TV TV /~I7V TV "A 

AA Seq: ACGACAAA 


35 
36 


Kasir, etal., 1999. JBioi 
Chem, 274:24873-80. 


HA epitope 


5 ' TACCCATACGACGTACCAGACTACGCA 
AA Seq: yPYDVPBYA 


37 
38 


Smitii, et al.. 1999. J Biol 
Chem. 274:19894-900. 


KT3 epitope 


5 ' CCACCAGAACCAGAAACA 
AA seq: PPEPET 


39 
40 


MacArttiur and Walter. 
1984. J Virol. 62:483-91. 


Myc epitope 


5 ' GCAGAAGAACAAAAATTAATAAGCGAAGA 
AGACTTA 

AA Seq: AEEQKLISEEDL 


41 
42 


Gosney, etat., 1990. 
Anticancer Res. 10:623-8. 











EYFP: SEQ ID NO: 43 (Nucleic acid); SEQ ID NO:44 (Amino acid) 

MVSK GEEL FTGV.VPIL VELD. 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 

GDVN GHKF SVSG EGEG DATY 
GGCGACGTAAAC GGCCACAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC GATGCCACCTAC 

GKLT LKFI CTTG KLPV PWPT 
GGCAAGCTGACC CTGAAGTTCATC TGCACCACCGGC AAGCTGCCCGTG CCCTGGCCCACC 

LVTT F G Y ^ G LQCF ARY^P DHMK 
CTCGTGACCACC TTCGGCTACGGC CTGCAGTGCTTC GCCCGCTACCCC GACCACATGAAG 



QHDF FKS A MPEG YV Q E RTIF 
CAGCACGACTTC TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 

FKDD GNYK TRAE VKFE GDTL 
TTCAAGGACGAC GGCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

V N R I E ^ H 

GTGAACCGCATC GA <3GCAC 



K L E Y N N 
AAGCTGGAGTAC AACTACAACAGC CACAACGTCTAT ATCATGGCCGAC AAGCAGAAGAAC 

GIKV NFKI RHNI EDGS VQLA 
GGCATCAAGGTG AACTTCAAGATC CGCCACAACATC GAGGACGGCAGC GTGCAGCTCGCC 

DHYQ QNTP IGDG PVL L PDNH- 
GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 

YLSY QSAL SKDP NEKR DHMV 
TACCTGAGCTAC CAGTCCGCCCTG AGCAAAGACCCC AACGAGAAGCGC GATCACATGGTC 
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LLEF VTAA GITL GMDE LYK 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



EGFP: SEQIDNO:45 (Nucleic acid); SEQ ID NO:46 (Amino acid) 

MVSK GEEIi FTGV VPIL VELD 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 

GDVN GHKP SVSG EGEG DATY 
GGCGACGTAAAC GGCCAGAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC GATGCCACCTAC 

GKLT LKFI CTTG KLPV PWPT 
GGCAAGCTGACC CTGAAGTTCATC TGCACCACCGGC AAGCTGCCCGTG CCCTGGCCCACC 

LVTT LTYG VQCF SRYP DHMK 
CTCGTGACCACC CTGACCTACGGC GTGCAGTGCTTC AGCCGCTACCCC GACCACATGAAG 

QHDF FKSA MPEG YVQE RTIF 
CAGCACGACTTC TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 

FKDD GNYK TRAE VKFE GDTL 
TTCAAGGACGAC GGCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

VNRI E LKG IDFK ED GN ILGH 
GTGAACCGCATC GAGCTGAAGGGC ATCGACTTCT^G GAGGACGGCAAC ATCCTGGGGCAC 

KLEY NYNS HNVY IMAD KQKN 
AAGCTGGAGTAC AACTACAACAGC CACAACGTCTAT ATCATGGCCGAC AAGCAGAAGAAC 

GIKV NFKI RHNI ED GS VQLA 
GGCATCAAGGTG AACTTCAAGATC C^CCACAACATC GAGGACGGCAG;: GTGCAGCTCGCC 

DHYQ QNTP IGDG PVLL PDNH 
GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 

YLST QSAL SKDP NE. KR DHMV 
TACCTGAGCACC CAGTCCGCCCTG AGCAAAGACCCC AACGAGAAGCGC GATCACATGGTG 

LLEF VTAA GITL GM DE LYK 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



EBPP : SEQ ID NO:47 (Nucleic acid); SEQ ID NO:48 (Amino acid) 

MVSK GEEL FTGV VPIL VELD 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 

GDVN GHKF SVS G EGEG DATY 
GGCGACGTAAAC GGCCAGAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC GATGCCACCTAC 

GKLT LKFI CTTG KLPV PWPT 
GGCAAGCTGACC CTGAAGTTCATC TGCACCACCGGC AAGCTGCCCGTG CCCTGGCCCACC 
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LVTT LTHG VQCF SRY P DHMK 
CTCGTGACCACC CTGACCCACGGC GTGCAGTGCTTC AGCCGCTACCCC GACCACATGAAG 

QHDF FKS^A MPEG YVQE RTIF 
CAGCACGACTTC TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 

FKDD GNYK TRAE VKFE GDTL 
TTCAAGGACGAC GGCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

VNRI ELKG IDFKEDGN ILGH 
GTGAACCGCATC GAGCTGAAGGGC ATCGACTTCAAG GAGGACGGCAAC ATCCTGGGGCAC 

KIiEY NFNS HNVY IMAD KQKN 
AAGCTGGAGTAC AACTTCAACAGC CACAACGTCTAT ATCATGGCCGAC AAGCAGAAGAAC 

GIKV NFKI RHN I EDGS VQliA 
GGCATCAAGGTG AACTTCAAGATC CGCCACAACATC GAGGACGGCAGC GTGCAGCTCGCC 

DHYQ QNTP IGDG PVLL PDNH 
GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 

YXjST QSAL SKDP NEKR DHMV 
TACCTGAGCACC CAGTCCGCCCTG AGCAAAGACCCC AACGAGAAGCGC GATCACATGGTC 

LliEF VTA A GITL GM'DE LYK 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



ECFP: SEQIDNO:49 (Nucleic acid); SEQ ID NO:50 (Amino acid) 

MVSK GEEIj FTGV VP IIi VELD 
ATGGTGAGCAAG GGCGAGGAGCTG TTCACCGGGGTG GTGCCCATCCTG GTCGAGCTGGAC 

GDVN.GHKF SVSG EGEG DATY 
GGCGACGTAAAC GGCCACAAGTTC AGCGTGTCCGGC GAGGGCGAGGGC GATGCCACCTAC 

GK LT LKFI CTTG KIi-PV PWPT 
GGCAAGCTGACC CTGAAGTTCATC TGCACCACCGGC AAGCTGCCCGTG CCCTGGCCCACC 

LVTT LTWG VQCF SR YP DHMK 
CTCGTGACCACC CTGACCTGGGGC GTGCAGTGCTTC AGCCGCTACCCC GACCACATG7VAG 

QHDF FKSA MPEG YVQE RTIF 
CAGCACGACTTC TTCAAGTCCGCC ATGCCCGAAGGC TACGTCCAGGAG CGCACCATCTTC 

FK DD G N Y K TRAE VKFE GDTL 
TTCAAGGACGAC GGCAACTACAAG ACCCGCGCCGAG GTGAAGTTCGAG GGCGACACCCTG 

VNRI ELKG IDFK EDGN ILG H 
GTGAACCGCATC GAGCTGAAGGGC ATCGACTTCAAG GAGGACGGCAAC ATCCTGGGGCAC 

KLEY NYIS H NVY ITAD KQKN 
AAGCTGGAGTAC AACTACATCAGC CACAACGTCTAT ATCACCGCCGAC AAGCAGAAGAAC 
GIKA NFKI RHNI EDGS VQLA 
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GGCATCAAGGCC AACTTCAAGATC CGCCACAACATC GAGGACGGCAGC GTGCAGCTCGCC 

DHYQ QNTP IGDG PVLL PDNH 
GACCACTACCAG CAGAACACCCCC ATCGGCGACGGC CCCGTGCTGCTG CCCGACAACCAC 

YLST QSAL SKDP N EKR DHMV 
TACCTGAGCACC CAGTCCGCCCTG AGCAAAGACCCC AACGAGAAGCGC GATCACATGGTC 

LLEF VTAA GITL GMDE LYK 
CTGCTGGAGTTC GTGACCGCCGCC GGGATCACTCTC GGCATGGACGAG CTGTACAAG 



Fred25: SEQ ID NO:51 (Nucleic acid); SEQ ID NO: 52 (Amino acid) 

MASK GEEL FTGV VPIL VELD 
ATGGCTAGCAAA GGAGAAGAACTC TTCACTGGAGTT GTCCCAATTCTT GTTGAATTAGAT 

G D V N GHKF SVSG EGEG DATY 
GGTGATGTTAAC GGCCACAAGTTC TCTGTCAGTGGA GAGGGTGAAGGT GATGCAACATAC 

G KLT LKFI CTTG KLPV PWPT. 
GGAAAACTTACC CTGAAGTTCATC TGCACTACTGGC AAACTGCCTGTT CCATGGCCAACA 

liVTT LCYG VQCF SRY P DHMK 
CTAGTCACTACT CTGTGCTATGGT GTTCAATGCTTT TCAAGATACCCG GATCATATGAAA 

RHDF FKSA MPEG Y V QE RTIF 
CGGCATGACTTT TTCAAGAGTGCC ATGCCCGAAGGT TATGTACAGGAA AGGACCATCTTC 

FKDD GNYK TRAE VKFE GDTL 
TTCAAAGATGAC GGCAACTACAAG ACACGTGCTGAA GTCAAGTTTGAA GGTGATACCCTT 

VNRI ELKG IDFk'^EDGN ILGH 
GTTAATAGAATC GAGTTAAAAGGT ATTGACTTCAAG GAAGATGGCAAC ATTCTGGGACAC 

KLEY NYN S HNV Y IMAD KQKN 
AAATTGGAATAC AACTATAACTCA CACAATGTATAC ATCATGGCAGAC AAACAAAAGAAT 

GIKV NFKT RHNI EDGS VQLA 
GGAATCAAAGTG AACTTCAAGACC CGCCACAACATT GAAGATGGAAGC GTTCAACTAGCA 

DHYQ QNTP IGDG PVL L PDNH 
GACCATTATCAA CAAAATACTCCA ATTGGCGATGGC CGTGTCCTTTTA CCAGACAAGCAT 

YLST QSAL SKDP NEKR D HMV 
TACCTGTCCACA CAATCTGCCCTT TCGAAAGATCCC AACGAAAAGAGA GACCACATGGTC 

LLEF VTAA GITH GMDE LYN* 
CTTCTTGAGTTT GTAACAGCTGCT GGGATTACACAT GGCATGGATGAA CTGTACAACTAG 
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Substrate 

Recognitions 

Sequences 


Source 


Recognition Site 


SEQID 
NO 


Reference 


Caspase-l,4,S 


peptide library 


5 *(TQG,TTA)GAACATGACAA 
Seq:(WX)EHD/ 


53 
54 


Thombeny et al., 1997, J. Biol. 
Chem. 272:17907 


proCaspase-l 


peptide library 


5'TQGTTTAAAGAC 
AASeq:WFK0/ 


55 
56 


Thomberry et al., 1997, J. Biol. 
Chem. 272:17907 


Ca8pase-2 


peptide libraiy 


5»GACGAACACGAC 
AA Seq: DEHD/ 


57 
58 


Thombeny et ah, 1997, J. Biol. 
Chem. 272:17907 


Caspase 3, 7 


PARP 


5'GACGAAGTTGAC 
AA Seq: DEVD/ 


59 
60 


Beneke, et al.. 1997. Biochem 
Mol Biol Int 43:755-61; 
Thombeny et ai., 1997, J. Biol. 
Chem. 272:17907 


ProCaspase 3 


Caspase-3 


5*ATAGAAACAGAC 
AA Seq: lETD/ 


61 
62 


Tewari,M„etal., 1995. Cell. 
81:801-9. 


ProCaspase-4,5 


peptide library 


5TGGGTAAGAGAC 
AA Seq: WVRD/ 


63 
64 


Thombeny, N. A. etal., 1997, 
J.Biol. Chem. 272, 17907-1791 1 


Caspase 6 


Lamin A, 
peptide library 


5'GTAQAAATAGAC 
AA Seq; VEID/ 
5'GTAGAACACGAC 
AASea: VEHD/ 


65 
66 
67 
68 


Nakajimaand Sado. 1993. 
Biochim Biophys Acta. 1 171 :3 1 1- 
4; Thombeny et al., 1997, J. Bio!. 
Chem. 272:17907 


proCaspase 6 


Ca5pase-6 


5'ACAGAAGTAGAC 
AASeq:TEVD/ 


69 
70 


Femandes-Alnemri, et al., 1994. J 
Biol Chem. 269:30761-4. 


proCaspase-7 


peptide library 


5'ATACAAGCAGAC 
AA Seq: IQAD/ 


71 
72 


Thombeny, N.A. et al., 1997, 
J.Biol. Chem. 272, 17907-17911 


Caspase S • 


peptide library 


5'GTAGAAACAGAC 
AASeq: VETD/, 


73 
74 


Muzio, M.,etal.. 1996. Cell. 
85:817-27; Femandes-Alnemri, et 
al., 1996. Proc Natl Acad Sci U S 
A. 93:7464-9;Thomberry et al., 
1997, L Biol. Chem. 272:17907 


pFoCaspase-8 


Caspase-8 


5'TTAGAAACAGAC 
AA Seq: LETD/ 


75 
76 


Muzio, M., et al.. 1996, Cell. 
85:817-27; Femandes-Alnemri, et 
al., 1996. Proc Natl Acad Sci U S 
A. 93:7464-9;Thomberry et al., 
1997. J. Biol. Chem. 272:17907 


Caspase 9 


peptide library 


5TTAGAACACGAC 
AA Seq: LEHD/ 


77 
78 


Thomberry, N A. etal., 1997, 
J,Biol. Chem. 272, 17907-17911 


proCaspase 9 


Caspase-9 


5'TTAGAACACGAC 
AA Seq: LEHD/ 


79 
80 


Thombeny, N.A. etal., 1997, 
J.Biol. aiem, 272, 17907-17911 


HIV protease 




5'AGCCAAAATTAC 
AA Seq: SQNY/ 

5'CCAATAGTACAA 
AASeq: PIVQ/ 


81 
82 

83 
84 


Matayoshi, et al., 1990, Science. 
247:954-8. 


Adenovirus 
endopeptidase 




5'AUGTTTGGAGGA 
AASeq:MFGG/ 

5'GCAAAAAAAAGA 
AA Seq: AKKR/ 


85 
86 

87 
88 


Weber and Tihanyi. 1 994. 
Methods Enzymol. 244:595-604, 


b-Secretase 


Amyloid 
precursor 
protein 


5'GTAAAAAUG 
AA Seq: VKM/ 

5'GACGCAGAATTC 
DAEF/ 


89 
90 

91 
92 


Hardy et al„ 1994, in Amyloid 
Protein Precursor in 
Development, Aging, and 
Alzheimer's Disease, ed. C.L. 
Masters et al.. pp. 190-198. 


Cathepsin D 




5 * AAACCAGCATTATTC 
AA Seq: KPALF 

5'TTCAGATTA 
AA Seq: FRJLJ 


93 
94 

95 
96 


Durm, et al., 199S. Adv Exp Med 
Biol. 436:133-8. 


Matrix 

Metalloproteases 




5 •GGACCATTAGGACCA 
AA Seq: GPLGP 


97 
98 


Bouvier et al., 1993; Garbett et 
al., 1999; Hill and Sakanari, 1997; 
Kojima et al., 1998; Tyagi et al., 
1995; Wilhelmet al.. 1993; 
Williams andAuld, 1986; 
Haugland, R., Handbook of 
fluorescent probes and research 
Chemicals 7th ed. 


Granzyme B 


peptide library 


5 ' ATAGAACCAGAC 
AA Seq: lEPD/ 


99 
100 


Thombeny etal., 1997, J. Biol. 
Chem. 272:17907 
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Anthrax protease 


MEKl 


5 • ATGCCCAAGAAG AAGCCQ AC 
GCCCATCCAGCTGAACCC 

AA Seq: MPKKKPTPIOLN 


101 
102 


.Vitaleetal.,(1998) Biochem 
Biophys Res Commun 248 (3), 
706-711 


Anthrax nrntca!;e 


MEK2 


5 ' ATGCTGGCCCGG AGO AAGCCG 

GTGCTGCCGGCGCTCACCATCA 

ACCC 

AA Seq: MLARRKPVLPALTIN 


103 
104 


Vitale et al., (1998) Biochem 
Biophys Res Commun 248 (3), 
706-711 


tnfnniic/hnhiliniiTTi 


cellubrevin 


5 »GCCTCGCAGTTTGAAACA 
AA Seq: ASOFET 


105 
106 


McMahon et al., Nature 364:346- 
349; Martin et a!.. J. Cell Biol. In 
press 


tetanus/botulinum 


synaptobrevin/ 
VAMP3 


5*GCTTCTCAATTTGAAACG 
AA Seq: ASOFET 


107 
108 


Schiavo et al., (1992) Nature 
359. 832-5 


Sotuiinum 
neurotoxin A 


SNAP-25 


5'GCCAACCAACGTGCAACA 
AA Seq: ANO/T(AT 


109 
110 


Zhao, et al. Gene 145 (2). 313- 
314(1994) 


Rnfitlinum 

neurotoxin B 


VAMP 


5 'GCTTCTCAATTTGAAACG 
AA Seq: ASO/FET 


111 
112 




"Rrktiiliniim 

neurotoxin C 


Syntaxin 


5 ' ACGAAAAAAGCTGTGAAA 
AA Seq: TKK/AVK 


113 
114 


Martin et al., J. Leukoc. Biol. 65 
(3\ 397-406 fl999> 


Botulinum 

neurotoxin D 


VAMP 


5 *GACCAGAAGCTCTCTGAG 
AA Seq: DOK/LSE 


115 
116 




Botulinum 

neurotoxin E 


SNAP-25 


5 • ATCG ACAGGATCATGG AG 
AASeq!lDR/IME 


117 
118 




Botulinum 

neurotoxin F 


VAMP 


5'AGAGACCAGAAGCTCTCT 
AA Seq: RDO/KLS 


119 
120 




Botulinum 
neurotoxin G 


VAMP 


5'ACGAGCGCAGCCAAGTTG 
AASeq:TSA/AKL 


121 
122 
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FIGURE 29C 

3. PRODUCT/REACTANT TARGET SEQUENCES 



Target 


Target Source 


Target domain (Product or Reactant) 


SEQ ID 
NO 


Reference 


Cytoplasm/cytos 
keleton 


Aimexin H 


5 • ATOTCTACTGTCCACGAAATCCTGTGCAAG 
CTCAGCTTGGAGGGTGTTCATTCTACACCCCC 
AAGTGCC3' 

(Amino acidseq: MSTVHEILCKLSL 
EGVHSTPPSA) 


123 
124 


Eberhard, et al., 
1997. MoI.Biol. 
CeUS:293a, 


Inner surface of 

plasma 

membrane 


&mesylation 


5 ' AUGGGATCTACATTAAGCGCAGAAG ACAA 
AGCAQCAGTAGAAAGAAGCAAAAUGATAGA 
CAGAAACTTATTAAGAGAAGACGGAGAAAA 
AGCTGCTAGA3' 

(AAscq:MaCTLSAEDKAAVER 
SKMIDRNLREDGEKAAR 


125 
126 


Femiccio G, et aL, 
J. Biol. Chem. 274, 
5843-5850. 1999 


Nucleus 


NFkB p50 


5'AGAAGGAAACX3ACAAAAG 
(AA seq: R R K R Q K) 


127 
128 


Henkel,T etal.. 
Cell 68.1121- 
1133, 1992 


Nucleolus 


NOLP 


5'AGAAAACGTATACGTACTTACCTCAAGTCC 
TGCAGGCGGATGAAAAGAAGTX3GTTTTGAGA 
TGTCTCX}ACCTATTCCTTCCCACCrrTACT 

(AAseq:R KRIRTYLKSCRRMK 
RSGFEMSRPIPSHLT) 


129 
130 


Ueki, etal., 1998. 
Biochem Biophys 
ResComznun, 
252:97-102. 


Mitodiondria 


cyto chrome c 
oxidase 


5 * ATGTCCGTCCTGACGCCGCTGCTGCTGCGG 
GGCTTGACAGGCTCGGCCCGQCGQCTCCCAG 
TGCCGCGCGCCAAGATCCATTCGTTG 

(AASeqiMSVLTPLLLRGLTGS 
ARRLPVPRALIHSU 


131 
132 


Rizzuto. et al.. 
1989. J Biol Chem. 
264:10595-600. 


Nuclear Envelope 


ODV-E66& 
ODV-E25 


5 * AUQAGCATTGTTTTAATAATTGTTATTTOOA 
rrn'ri'i'AATATGTi-i'rr'lATAnTAAGCAACA 
GCAAAGATCCCAGAGTACCAGTTGAATTAAU 
G 

(AA Seq: MSIVLIIVIVVIFLICF 
LYLSNSKDPRVPVELM) 


133 
134 


Hong, T, et aL 
PNAS, 94. 4050- 
4055, 1997 


Golgi 


Calreticulm 


5'ATOAGGCTTCGGGAGCCGCTCCTGAGCGGC 

AGCX3CCGCGATGCCAGGCGCGTCCCTACAGC 

GGGCCTGCCGCCTGCTCGTGGCCGTCTGCGCT 

CTGCACCTTGGCGTCACCCTCQTTTACTACCT 

GGCTGGCCGCGACXrrGAGCCGCCTGCCCCAA 

CTGGTCOGAGTCTCCACACCGCTGCAGGQCG 

GCTCGAACAGTaCCGCCaCCATCGGGCAGTC 

CTCCGGGQAGCTCCGGACCGOAGQGGCC 

(AASeqiMRLREPLLS GSAAMP 

VYYLAGRDLSRLPQLVGVSTPLQG 
GSNSAAAIGOSSGELRTGGA) 


135 
136 


Fliegel, L., et al., J. 
Biol. Chem. 264, 
21522-21528, 
1989. 


&idoplasmic 
reticulutn 


D-AKAPl 


5 'GAAACAATAAGACCTATAAGAAGATGTAGT 

TTCAATTAAGATCrCCCTrrCCATTAGCATTA 
CCAGOAAUGTTAGCTTTATTAGGATGGTGGr 
GGTmTCAGTAGAAAAAAA 

(AASeqiETIRPIRIRRCS YFTSTDSKM 

AIQLRSPFPLALPGMLALLGWWW 

FFSRKK 


137 
138 


Huan& U. Et al., 
J. Cell. Biol. 145, 
951-959. 1999 


Nuclear Export 


MEK1 


5 ' GCCTTGCAGAAGAAGCTGGAGGAGCT 
A£3AGCTTGATGAG 

(AA SEQ: A LQKKLEELE 
L D E 


139 
140 


Fukuda. (1997) 
J. Blot. Chem 
272, 51, 32B42- 
32648 


Size exclusion 


PROJ domain of 
MAP4 


S'GCCGACCrCAGTCTTGTGGATGCGTTGACA 


141 


West, (1991). J 
Biol Chem 
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AGCGAGACTTCATGGCTGCGCTGGAGGCAGA 

GCCCTATOATGACATCQTGGQAGAAACTGTG 

GAGAAAACTGAGTTTATTCCTCrCCTGGATGG 

TGATGAGAAAACCQGaAACTCAGAGTCCAAA 

AAOAAACCCTGCTTAGACACTAGCCAGGTTa 

AAGGTATCCCATCTTCTAAACCAACACTCCTA 

GCCAATGGTGATCATGGAATGGAGGGGAATA 

ACACTGCAGGGTCTCCAACTGACTTCCTTGAA 

GAGAGAGTGGACTATCCGGATTATCAGAGCA 

GCCAGAACTGGCCAGAAGATGCAAGCTTTTG 

TTTCCAGCCTCAGCAAGTGTTAGATACTGACC 

AGGCTOAGCCCTTTAACGAGCACCGTGATGA 

TGGTTTGGCAGATCTOCTCTTTGTCTCCAGTG 

GACCCACGAACGCTTCTGCATTTACAGAGCG 

AGACAATCCTTCAGAAGACAGTTACGGTATQ 

CnrCCCTGTGACTCATTTGCTTCCACGGCTGT 

TGTATCTCAGGAGTGGTCTGTGGGAGCCCCA 

AACTCTCCATGTTCAGAGTCCTGTGTCTCCCC 

AGAOGTTACTATAGAAACCCTACAGCCAGCA 

ACAGAGCTCTCCAAGGCAGCAGAAGTGGAAT 

CAGTGAAAQAGCAGCTGCCAGCTAAAGCATT 

GGAAACGATGGCAGAGCAGACCACTGATGTG 

GTGCACTCTCCATCCACAGACACAACACCAG 

GCCCAGACACAGAGGCAGCACTGGCTAAAGA 

CATAGAAGAGATCACCAAGCCAGATGTGATA 

TTGGCAAATGTCACQCAGCCATCTACTGAAT 

CGGATATGTTCCTGGCCCAGGACATGGAACT 

ACTCACAGGAACAGAGGCAGCCCACGCTAAC 

AATATCATATTGCCTACAGAACCAGACGAAT 

CTTCAACCAAGGATGTAGCACCACCTATGGA 

AGAAGAAATTGTCCCAGQCAATGATA 

(AASEQ: ADLSLVDALTEPPPEIEGBI 
KRDFMAALEAEP YDDIVGETVEKT 
EFIPLLDGDEKTGNSESKKKPCLD 
TSQ VEGIPSSKPTLLANGDH GMEG 
NNTAGSPTDFLEERVDYPDYQSS 
QNWPED ASFCFQPQQVLDTDQAE 
PFNEHRDDGLADLLFVSSGPTNAS 
AFTERDNPSEDSYGMLPCDSFAST 
AVVSQEWS VGAPNSPCSESC VSP 
EVTIETLQP ATELSKAAEVES VKE Q 
LPAKALETMAEQTTDVVHSPSTDT 
TPG P DTE AALAKDIEEITKPD VILA 
NVTQPSTESDMFL AQDMELLTGTE 
AAHANNIILPTEPDBSSTKDVAPPM 
EEEIVPGNDTTSPKETETTLPIKMD 
LAPPEDVLLTKETELAPAKGMVSL 
SEIEEALAKND VRSAEIPVAQETV 
VSETEVVLATE VVLPSDPITTLTK 
DVTLPLEAERPLVTDMTPSLETEM 
TLGKETAPPTETNLGMAKDMSPLP 
ESEVTLGKDVVILPETKVAEFNNV 
TPLSEEEVTSVKDMSPSAETEAPL 
AKNADLHSGTELI VDNSMAP ASDL 
ALPLETKVATVPIKDKG 


142 ■ 


266(32): 21886- 
96; Olson, K. R. 
(1995). J Cell 
Biol130(3): 639- 
50. 












Vesicle 
membrane 


Synaptobrevin 


5 ' ATGTGGGCAATCGGGATTACTGTTCT 
GGTTATCTTCATCATCATCA.TCATCGTG 
TGGGTTGTC 

{AA SBQ: MWAIGITVLV 
IFIIIIIVWVV) 


143 
144 


Schiavo et aL, 
(1992) Nature 
359. 832-5 


Vesicle 
membrane 


Celiubrevin 


5 ' ATGTGGGCGATAGGGATCAGTGTCCT 
GGTGATCATTGTCATCATCATCATCGTG 
TGGTGTG 

(AA SEQ: MWAIGISVLV 
IIVIIIIVWC) 


145 
146 


McMahonet al.. 
Nature 364:346- 
349; Martin et al., 
J. Cell Biol. In 
press 


Nuclear Export 


MEK2 


5 ' GACCTGCAGAAGAAGCTGGAGGAGCT 
GGAACTTGACGAG 

AA SEQ: D L Q KK L E E L E L D E 


147 
148 


Zheng and Guan, 
J. Biol. Chem. 
268:11435-11439, 
1993 
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Peroxisome 


PX 


5 ' TCTAAACTG 
AA SEQ: S K L 


149 
150 


Amery et al., 
Biochem. J. 
335'367-371 
fl998) 













Microtubules CViAP4) SEQIDNO:151 (Nucleic acid); SEQ ED NO: 152 (amino acid) 



MAP4: 

MADL SLVD ALTE PPPB lEGE 
ATGGCCGACCTC AGTCTTGTGGAT GCGTTGACAGAA CCACCTCCAGAA ATTGAGGGAGAA 
TACCGGCTGGAG TCAGAACACCTA CGCAACTGTCTT GGTGGAGGTCTT TAACTCCCTCTT 

IKRD FMAA LEAE PYDD IVGE 
ATAAAGCGAGAC TTCATGGCTGCG CTGGAGGCAGAG CCCTATGATGAC ATCGTGGGAGAA 
TATTTCGCTCTG AAGTACCGACGC GACCTCCX3TCTC GGGATACTACTG TAGCACCCTCTT 

TVEK TEF I PIiLD GDEK TGNS 
ACTGTGGAGAAA ACTGAGTTTATT CCTCTCCTGGAT GGTGATGAGAAA ACCGGGAACTCA 
TGACACCTCTTT TGACTCAAATAA GGAGAGGACCTA CCACTACTCTTT TGGCCCTTGAGT 

ESKK KPCIi DTS Q VEGI PS SK 
GAGTCCAAAAAG AAACCCTGCTTA GACACTAGCCAG GTTGAAGGTATC CCATCTTCTAAA 
CTCAGGTTTTTC TTTGGGACGAAT CTGTGATCGGTC CAACTTCCATAG GGTAGAAGATTT 

PTLL ANGD HGME GNNT AGSP 
CCAACACTCCTA GCCAATGGTGAT CATGGAATGGAG GGGAATAACACT GCAGGGTCTCCA 
GGTTGTGAGGAT CGGTTACCACTA GTACCTTACCTC CCCTTATTGTGA CGTCCCAGAGGT 

TDFL EERV DYPD YQSS QNWP 
ACTGACTTCCTT GAAGAGAGAGTG GACTATCCGGAT TATCAGAGCAGC CAGAACTGGCCA 
TGACTGAAGGAA CTTCTCTCTCAC CTGATAGGCCTA ATAGTCTCGTCG GTCTTGACCGGT 

EDAS FCFQ PQQV LDTD QA^P 
GAAGATGCAAGC TTTTGTTTCCAG CCTCAGCAAGTG TTAGATACTGAC CAGGCTGAGCCC 
CTTCTACGTTCG AAAACAAAGGTC GGAGTCGTTCAG T^TCTATGACTG GTCCGACTCGGG 

FMBH RDDG IjA DL LFVS SG PT 
TTTAACGAGCAC CGTGATGATGGT TTGGCAGATCTG CTCTTTGTCTCC AGTGGACCCACG 
AAATTGCTCGTG GCACTACTACCA AACCGTCTAQAC GAGAAAGAQAGG TCACCTGGGTGC 

NASA FTER DNPS ED SY GMLP 
AACGCTTCTGCA TTTACAGAGCGA GACAATCCTTCA GAAGACAGTTAC GGrATGCTTCCC 
TTGCGAAGACGT AAATGTCTCGCT CTGTTAGGAAGT CTTCTGTCAATG CCATACGAAGGG 



CDSF ASTA VVSQ BWSV GAPN 
TGTGACTCATTT GCTTCCACGGCT GTTGTATCTCAG GAGTGGTCTGTG GGAGCCCCAAAC 
ACACTGAGTAAA CGAAGGTGCCGA CAACATAGAGTC CTCACCAGACAC CCTCGGGGTTTG 

SPCS ESCV SPEV TIET LQPA 
TCTCCATGTTCA GAGTCCTGTGTC TCCCCAGAGGTT ACTATAGAAACC CTACAGCCAGCA 
AGAGGTACAAGT CTCAGGACACAG AGGGGTCTCCAA TGATATCTTTGG GATGTCGGTCGT 

TELS KAAE VESV KEQL PAKA 
ACAGAGCTCTCC AAGGCAGCAGAA GTGGAATCAGTG AAAGAGCAGCTG CCAGCTAAAGCA 
TGTCTCGAGAGG TTCCGTCGTCTT CACCTTAGTCAC TTTCTCGTCGAC GGTCGATTTCGT 
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LETM AEQT TDVV HSPS TDTT 
rXGGAAACGATG GCAGAGCAGACC ACTGATGTGGTG CACTCTCCATCC ACAGACACAACA 
AACCTTTGCTAC CGTCTCGTCTGG TGACTACACCAC GTGAGAGGTAGG TGTCTGTGTTGT 

PGPD TBAA LAKD lEEl TKPD 
CCAGGCCCAGAC ACAGAGGCAGCA CTGGCTAAAGAC ATAGAAGAGATC ACCAAGCCAGAT 
GGTCCGGGTCTG TGTCTCCGTCGT GACCGATTTCTG TATCTTCTCTAG TGGTTCGGTCTA 

VILA NVTQ PSTE SDMF IiAQD 
GTGATATTGGCA AATGTCACGCAG CCATCTACTGAA TCGGATATGTTC CTGGCCCAGGAC 
CACTATAACCGT TTACAGTGCGTC GGTAGATGACTT AGCCTATACAAG GACCGGGTCCTG 

MELL TGTE AAHA NNII LPTE 
ATGGAACTACTC ACAGGAACAGAG GCAGCCCACGCT AACAATATCATA TTGCCTACAGAA 
TACCTTGATGAG TGTCCTTGTCTC ' CGTCGGGTGCGA TTGTTATAGTAT AACGGATGTCTT 

PDES STKD VAPP M EEE IVPG 
CCAGACGAATCT TCAACCAAGGAT GTAGCACCACCT ATGGAAGAAGAA ATTGTCCCAGGC 
GGTCTGCTTAGA AGTTGGTTCCTA CATCGTGGTGGA TACCTTCTTCTT TAACAGGGTCCG 

NDTT SPKB TETT IiPIK MDLA 
AATGATACGACA TCCCCCAAAGAA ACAGAGACAACA CTTCCAATAAAA ATGGACTTGGCA 
TTACTATGCTGT AGGGGGTTTCTT TGTCTCTGTTGT GAAGGTTATTTT TACCTGAACCGT 

PPED VIi-L T KETE LAPA KGMV 
CCACCTGAGGAT GTGTTACTTACC T^GAAACAGAA CTAGCCCGAGCC AAGGGCATGGTT 
GGTGGACTCCTA CACAATGAATGG TTTCTTTGTCTT GATCGGGGTCGG TTCCCGTACCAA 

SLSE lEEA LAKN DVRS AEIP 
TCACTCTCAGAA ATAGAAGAGGCT CTGGCAAAGAAT GATGTTCGCTCT GCAGAAATACCT 
AGTGAGAGTCTT TATCTTCTCCGA GACCGTTTCTTA CTACAAGCGAGA CGTCTTTATGGA 

VAQE TVVS ETBV VLAT EVVIi 
GTGGCTCAGGAG ACAGTGGTCTCA GAAACAGAGGTG GTCCTGGCAACA GAAGTGGTACTG 
CACCGAGTCCTC TGTCACCAGAGT CTTTGTCTCCAC CAGGACCGTTGT CTTCACCATGAC 

PSDP ITTIi TKDV TLPL EAKR 
CCCTCAGATCCC ATAACAACATTG ACAAAGGATGTG ACACTCCCCTTA GAAGCAGAGAGA 
GGGAGTCTAGGG TATTGTTGTAAC TGTTTCCTACAC TGTGAGGGGAAT CTTCGTCTCTCT 



PLVT DMTP SLBT EMTL GKET 
CCGTTGGTGACG GACATGACTCCA TCTCTGGAAACA GAAATGACCCTA GGCAAAGAGACA 
GGCAACCACTGC CTGTACTGAGGT AGAGACCTTTGT CTTTACTGGGAT CCGTTTCTCTGT 

APPT ETNIi GMAK DMSP LPES 
GCTCCACCCACA GAAAGAAATTTG GGCATGGCCAAA GACATGTCTCCA CTCCCAGAATCA 
CGAGGTGGGTGT CTTTGTTTAi^C. CCGTACC6GTTT CTGTACAGAGGT GAGGGTCTTAGT 

E VTL GK DV VILP ETK V AEF N 
GAAGTGACTCTG GGCAAGGACGTG GTTATACTTCCA GAAACAAAGGTG GCTGAGTTTAAC 
CTTCACTGAGAC CCGTTCCTGCAC CAATATGAAGGT CTTTGTTTCCAC CGACTCAAATTG 

KVTP liSEE EV T S VKDM SPSA 
AATGTGACTCCA CTTTCAGAAGAA GAGGTAACCXCA GTCAAGGACATG TCTCCGTCTGCA 
TTACACTGAGGT GAAAGTCTTCTT CTCCATTGGAGT CAGTTCCTGTAC AGAGGCAGACGT 
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Figure 31 
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Figure 32 




BHK cells transfected with DEVD-caspase 
biosensor. (A) Cells before stimulation of apoptosis. 
(B) Another field of cells alter stimulation \with 
250 ng/ml cis-platin (4 h). 
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Figure 33 
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Figure 34 
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Figure 35 
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Figure 36 
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SEQUENCE LISTING 

. <110> Giuliano, Kenneth A. 
Kapur , Ravi 

<120> A System for Cell Based Screening 

<130> 97-022-L 

<140> To Be Assigned 
<141> Filed Herewith 

<160> 180 

<170> Patent In Ver. 2.0 

<210> 1 
<211> 1770 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . , (B82) 

<220> 

<223> Description of Artificial Sequence: 
GFP-DEVD-Annexin II construct 

<400> 1 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 



48 



gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gcc acc tae ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc ace ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc ace 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 S5 60 

ctg acc tae ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



288 



cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac ace ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
lis 120 125 . 
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ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Iiys Glu Asp Gly Asn He Leu Gly His liys Leu Glu Tyr 
130 135 140 



aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 
Gly He Lye Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 



480 



528 



gtg cag etc gee gac cac tac cag cag aac ace ccc ate ggc gac ggc 57 6 
val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 1S5 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age ace cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tlir Gin Ser Ala Leu 

195 200 205 J 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg ace gee gee ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct ggc gee ggc get gga gee gga get ggc gee gga gee 768 
Gly Leu Arg Ser Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala 
245 250 255 

gac gag gtg gac ggc gee ggc gcc gat gaa gta gat ggc gcc atg tct 816 
Asp Glu Val Asp Gly Ala Gly Ala Asp Glu Val Asp Gly Ala Met Ser 
260 265 270 

act gtc cac gaa ate ctg tgc aag etc age ttg gag ggt gat cat tct 864 
Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Asp His Ser 
275 280 285 

aca ccc cea agt gcc tat tgaatggtga gcaagggcga ggagctgttc 912 
Thr Pro Pro Ser Ala Tyr 
290 



accggggtgg 


tgeccatect 


ggtcgagctg 


gacggcgacg 


taaacggcca 


caagttcagc 


972 


gtgtecggcg 


agggcgaggg 


cgatgecace 


tacggcaage 


tgaccctgaa 


gttcatctgc 


1032 


accaccggca 


agetgcccgt 


gccctggeee 


accctcgtga 


ccaccctgae 


etacggcgtg 


1092 


cagtgcttca 


gccgetacce 


cgaccacatg 


aagcagcaeg 


acttctteaa 


gtccgccatg 


1152 


cecgaaggct 


acgtecagga 


gcgcaecatc 


ttetteaagg 


acgaeggcaa 


ctacaagacc 


1212 


cgegecgagg 


tgaagttcga 


gggegaeace 


ctggtgaace 


gcategagct 


gaagggcatc 


1272 


gaettcaagg 


aggaeggcaa 


catectgggg 


cacaagetgg 


agtaeaacta 


caacaofccac 


1332 


aacgtctata 


tcatggccga 


eaagcagaag 


aacggcatca 


aggtgaactt 


caagatecgc 


1392 
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cacaacatcg aggacggcag cgtgcagctc gccgaccact accagcagaa cacccccatc 14S2 
ggcgacggcc ccgtgctgct gcccgacaac cactacctga gcacccagtc cgccctgagc 1512 
aaagacccca acgagaagcg cgatcacatg gtcctgctgg agttcgtgac cgccgccggg 1572 
atcactctcg gcatggacga gctgtacaag tccggactca gatctggcgc cggcgctgga 1632 
gccggagctg gcgccggagc cgacgaggtg gacggcgccg gcgccgatga agtagatggc 1632 
gccatgtcta ctgtccacga aatcctgtgc aagctcagct tggagggtga tcattctaca 1752 
cccccaagtg cctattga 1770 

<210> 2 
<211> 294 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
GFP-DEVD-Annexin II construct 

<400> 2 

Met Val Ser Lys Gly Glu Glu Leu ?he Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys^ Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 ■ 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp .Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val lisu Leu Pro Asp Asn His Tyr Leu Ser Tlir Gin Ser iVla Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala 
245 250 255 

Asp Glu Val Asp Gly Ala Gly Ala Asp Glu Val Asp Gly Ala Met Ser 
260 265 270 

Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Asp His Ser 
275 280 285 

Thr Pro Pro Ser Ala Tyr 
290 



<210> 3 
<211> 2439 
<212> DNA 

<213> J^tificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (2436) 

<220> 

<223> Description of Artificial Sequence: 
EYFP-DEVD-MAPKDM construct 

<400> 3 ^ 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 



ttc ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc gac cac atg aag 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 



48 



gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 . 

tgc ace ace ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



240 



288 



336 
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Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lye Gly 
115 . 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser. Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Val Lys Ser Glu Gly Lys 
245 250 255 

Arg Lys Cys Asp Glu Val Asp Gly He Asp Glu Val Ala Ser Thr Met 
260 265 270 

Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His 
275 280 285 

Ser Thr Pro Pro Ser Thr Arg He 
290 / 295 

S 

<210> 13 
<211> 846 
<212> DMA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(846) 

<220> 

<223> Description of Artificial Secjuence: Caspase 
6 -VEip- substrate construct 

<400> 13 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

1 S 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 .25 30 

9^9 ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
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Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Piie Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat gac ttt ttc aag agt gcc atg ccc gaa ggt tat gta cag gaa 2 88 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 33 6 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 64 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac "aaa caa aag aat 4 80 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 i55 160 

gga ate aaa gtg aac ttc aag acc egc cac aac att gaa gat gga age 528 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac eat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gcc ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 720 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

gga aga agg aaa ega caa aag cga teg aca aga ctt gtt gaa att gac 7 68 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Thr Arg Leu Val Glu He Asp 
245' 250 255 

aac agt act atg age aca gta cac gaa att tta tgt aaa tta age tta 816 
Asn Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu 
260 265 270 

gaa gga gta cac agt aca cca cca age gca 846 
Glu Gly Val His Ser Thr Pro Pro Ser Ala 
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275 280 



<210> 14 

<211> 282 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 
6-VEID-siibstrate construct 

<400> 14 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys ^ Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
B5 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 . 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tlxr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Thr Arg Leu Val Glu He Asp 
245 250 255 
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Asn Ser Thr Met Ser Thr Val His 
260 

Glu Gly Val His Ser Thr Pro Pro 
275 280 



Glu lie Leu Cys Lys lieu Ser Leu 
265 270 

Ser Ala 



<210> 15 
<211> 876 
<2i2> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) , , (876) 

<220> 

<223> Description of Artificial Sequence: Caspase 8-VETD 
construct 

<400> 15 ' 
atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 4 8 
Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 ■ 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg ect gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 288 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 9b 95 

agg acc ate tt:c ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 i05 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
He Asp Phe Lys Glu Asp Gly Asn He lieu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 
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gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 528 
Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Aap Qly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att gge gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
IBO 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gcc ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat cec aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa. ctg tac aac tec 720 
Val Tlir Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

gga aga age aaa cga caa aag cga teg tat gaa aaa gga ata cca gtt 768 
Gly Arg Ser Lys Arg Gin Lys Arg Ser Tyr Glu Lys Gly lie Pro Val 
245 250 255 

gaa aca gac age gaa gag caa get tat agt act atg tct act gtc cac 816 
Glu Thr Asp Ser Glu Glu Gin Ala Tyr Ser Thr Met Ser Thr Val His 
260 265 270 

gaa ate ctg tgc aag etc age ttg gag ggt gtt eat tct aca ccc cca 864 
Glu lie Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr Pro Pro 
275 280 285 

agt gcc gga tec 
Ser Ala Gly Ser 
290 



<210> 16 . 
<211> 292' 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Caspase 8-VETD 
construct 

<400> 16 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly. Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 • 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Qly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 
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Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
65 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
"100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
IBO 1B5 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Sqx Lys Asp Pro Asn Glu Lys, Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Ser Lys Arg Gin Lys Arg Ser Tyr Glu Lys Gly He Pro Val 
245 250 255 

Glu Thr Asp Ser Glu Glu Gin Ala Tyr Ser Thr Met Ser Thr Val His 
^ 260 265 270 

Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr Pro Pro 
275 280 285 

Ser Ala Gly Ser 
290 



<210> X7 
<2il> 906 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) • . (906) 

<220> 

<223> Description of Artificial Sequence: Cas 3 -multiple 
DEVD construct 

<400> 17 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 48 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

1 5 . 10 15 
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gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr* 
50 55 60 



ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 BO 

egg cat gac ttt ttc aag agt .gee atg ccc gaa ggt tat gta cag gaa 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

as 90 95 



240 



286 



agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 33 6 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
13.5 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 / 150 155 160 

gga ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age 
Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 



480 



528 



gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



624 



cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tet gee ctt 
Pro Val Leu Leu Pro Asp Asn ' His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac eac atg gtc ctt ctt gag ttt 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca eat ggc atg gat gaa ctg tac aac tec 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

gga aga agg aaa cga caa aag cga teg gca ggt gac gaa gtt gat gca 768 
Gly Arg Arg Lys TVrg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 
245 250 255 



672 



720 
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ggt gac gaa gtt gat gca ggt gac gaa gtt gat gca ggt gac gaa gtt 816 
Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

gac gca ggt agt act atg tct act gtc cac gaa ate ctg tgc aag etc 864 
Asp Ala Gly Ser Thr Met Ser Thr Val His Glu lie Leu Cys Lys Leu 
275 280 285 

age ttg gag ggt gtt cat tct aca ccc cca agt gcc gga tec 906 
Ser Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
290 295 300 



<210> IB 
<2li> 302 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cas 3 -multiple 
DEVD construct 

<400> 18 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

Vai Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ilie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp^His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Qln Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



31 



wo 00/50872 



PCT/USOO/04794 



Ser Lys Asp Pro Asn Glu Lye Arg Asp His Met Val Leu Leu Glu Plie 
210 215 220 

Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Sear 
225 230 235 240 

Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 
245 250 255 

Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

Asp Ala Gly Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu 
275 280 285 

Ser Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
290 295 300 



<210> 19 
<211> 906 
<212> DNA 

<213> Artificial Secjuence 

<220> 

<221> CDS 

<222> (1) . . (885) 



<220> 

<223> Description of Artificial Sequence: Caspase 
8 -multiple VETD construct 

<400> 19 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 48 
Met-^Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 , 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Ttir 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65. '70 75 '80 

egg eat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 288 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 33 6- 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 
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gtc 
Val 


aag 
Lys 


ttt 
Phe 
115 


gaa 
Glu 


ggt 

Gly 


gat acc 
Asp Thr 


ctt 
Leu 
120 


gtt 

Val 


aat 
Asn 


aga 
Arg 


ate 
He 


gag tta aaa 
Glu Leu Lys 
125 


ggt 
Gly 


3B4 


att 
lie 


gac 
Asp 
13 0 


ttc 
Phe 


aag 
Lys 


gaa 
Glu 


gat ggc aac 
Asp Gly Asn 
135 


att 
Xle 


ctg 
Leu 


gga 
Gly 


cac 
His 
140 


aaa ttg 
Lys Leu 


gaa 
Glu 


tac 
Tyr 


432 


aac 
Abxi 
145 


tat 
Tyr 


aac 
Asn 


tea 
Ser 


cac 
His 


aat 
Asn 
ISO 


gta 
Val 


tac 
Tyr 


ate 
He 


atg 
Met 


gea 
Ala 
155 


gac 
Asp 


aaa caa aag 
Lys Gin Lys 


aat 
Asn 
160 


4B0 


gga 
Gly 


ate 
lie 


aaa 
Lys 


gtg 
Val 


aac 
Asn 
165 


ttc 
Phe 


aag 
Lys 


ace 
Thr 


cgc 
Arg 


cac 
His 
170 


aac 
Asn 


att 
He 


gaa gat 
Glu Asp 


gga age 
Gly Ser 
175 


528 


gtt 
Val 


caa 
Gin 


eta 
Leu 


gea 
Ala 
IBO 


gac 
Asp 


cat 
His 


tat 
Tyr 


caa 
Gin 


caa 
Gin 
185 


aat 
Asn 


act 
Thr 


eea 
Pro 


att ggc gat ggc 
He Gly Asp Gly 
190 


576 


cct 
Pro 


gtc 
Val 


ctt 
Leu 
195 


tta 
Leu 


eea 
Pro 


gac aac 
Asp Asn 


cat 
His 
200 


tac 
Tyr 


ctg 
Leu 


tec 
Ser 


aca 
Thr 


caa tct 
Gin Ser 
205 


gee 
Ala 


ctt 
Leu 


624 


teg 
Ser 


aaa 
Lys 

210 


gat 
Asp 


ccc 
Pro 


aac 
Asn 


gaa aag aga 
Glu Lys Arg 
215 


gac 
Asp 


cac 
His 


atg 
Met 


gtc 
Val 
220 


ctt Ctt 
Leu Leu 


gag 
Glu 


ttt 
Ph.e 


672 


gta 
Val 


aca 
Thr 


get 
Ala 


get 
Ala 


ggg 

Gly 


att 
lie 
230 


aca 
Thr 


cat 
His 


ggc 
Gly 


atg 
Met 


gat 
Asp 
235 


gaa 
Glu 


ctg tac 
Leu Tyr 


aac 
Asn 


tec 
Ser 
240 


720 


gga 

Gly 


aga agg aaa 
Ar^ Arg Lys 


cga 
Arg 
245 


caa 
Gin 


aag 
Lys 


cga 
Arg 


teg 
Ser 


gca 
Ala 
250 


ggt 
Gly 


gtt 
Val 


gaa aca gac gca 
Glu Thr Asp Ala 
255 


768 


ggt 
Gly 


gtt 

Val 


gaa 
Glu 


aca 
Thr 
260 


gac 
Asp 


gca ggt gtt 
Ala Gly Val 


gaa 
Glu 
2 65 


aca 
Thr 


gac 
Asp 


gca 
Ala 


ggt gtt 
Gly Val 
270 


gaa 
Glu 


aca 
Tlxr 


816 


gac 
Asp 


gca 
Ala 


ggt 
Gly 
275 


agt 
Ser 


act 
Thr 


atg 
Met 


tct 
Ser 


act 
Thr 
280 


gtc 
Val 


cac 
His 


gaa 

Glu 


ate 
He 


ctg tgc 
Leu Cys 
285 


aag 
Lys 


etc 
Leu 


864 



age ttg gag ggt gtt cat tct acacccccaa gtgccggatc c 906 
Ser Leu Glu Gly Val His Ser 
290 295 



<210> 20 
<211> 295 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<:223> Description of Artificial 
8 -multiple VETD construct 

<400> 20 

Met Ala Ser Lys Gly Glu Glu Leu 



Sequence : Caspase 

Phe Thr Gly Val Val Pro He Leu 
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X 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25. 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 - 155 160 

Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

~ Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Val Glu Thr Asp Ala 
245 250 255 

Gly Val Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Val Glu Thr 
260 265 270 

Asp Ala Gly Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu 
275 280 285 

Ser Leu Glu Gly Val His Ser 
290 295 



<210> 21 
<211> 4833 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (1) . . (4830) 

<220> 

<223> Description of Artificial Sequence: 
EyFP-DEVD-MAP4-EBPP construct 

<400> 21 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Qlu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec g^c 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee aec tac ggc aag ctg acc ctg aag ttc at:c 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 .45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Tlar 
50 55 60 

ttc ggc tac ggc ctg cag tgc ttc gee cgc tac ccc gac cac atg aag 240 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyx 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
ISO IBS 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 
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age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 

Ser Lys Asp Pro Asn Glu Lye Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag aag 720 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 
225 230 235 240 

gga gac gaa gtg gac gga atg gcc gac etc agt ctt gtg gat gcg ttg 768 

Gly Asp Glu Val Asp Gly Met Ala Asp Leu Ser Leu Val Asp Ala Leu 
24S 250 255 

aca gaa cca cct cca gaa att .gag gga gaa ata aag cga gac ttc atg 816 

Thr Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met 

260 265 270 

get gcg ctg gag gca gag ccc tat gat gac ate gtg gga gaa act gtg 864 
Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val 
275 280 285 

gag aaa act gag ttt att cct etc ctg gat ggt gat gag aaa acc ggg 912 

Glu Lye Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly 
290 295 300 



aac tea gag tec aaa aag aaa ccc tgc tta gac act age cag gtt gaa 

Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu 

305 310 315 320 

ggt ate cca tct tet aaa cca aca etc eta gee aat ggt gat cat gga 

Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly 

325 330 335 



gtc tec agt gga ccc acg aac get tet gca ttt aca gag cga gac aat 
Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn 
405 410 415 



960 



1008 



atg gag ggg aat aac act gca ggg tct cca act gac ttc ctt gaa gag 1056 

Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu 

340 / 345 350 

aga gtg gac tat cc^ gat tat cag age age cag aac tgg cca gaa gat 1104 

Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp 
355 360 365 

gca age ttt tgt ttc cag cct cag caa gtg tta gat act gac cag get 1152 

Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala 
370 375 380 

gag ccc ttt aac gag cac cgt gat gat ggt ttg gca gat ctg etc ttt 1200 

Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe 
385 390 395 400 



1248 



cct tea gaa gac agt tac ggt atg ctt ccc tgt gac tea ttt get tec 1296 
Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Sex 
420 425 430 

acg get gtt gta tct cag gag tgg tct gtg gga gcc cca aac tct cca. 1344 
Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro 
435 440 445 
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tgt tea gag tec tgt gtc tec cca gag gtt act ata gaa acc eta cag 13 92 
Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin 
450 455 460 

cca gca aca gag etc tec aag gea gca gaa gtg gaa tea gtg aaa gag 1440 
Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu 
465 470 475 4B0 

cag ctg cca get aaa gca ttg gaa acg atg gca gag cag acc act gat 148 8 
Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp 
485 490 495 

gtg gtg eac tct cca tec aca gac aca aca cca ggc cca gac aca gag 153 6 
Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu 
500 505 510 

gca gca ctg get aaa gac ata gaa gag ate acc aag cca gat gtg ata 1584 
Ala Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val He 
515 520 525 

ttg gea aat gtc acg cag cca tct act gaa teg gat atg ttc ctg gcc 1632 
Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala 
530 535 540 

cag gac atg gaa eta etc aca gga aca gag gca gcc cac get aac aat 1680 
Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn. Asn 
545 550 555 560 

ate ata ttg ect aca gaa cca gac gaa tct tea acc aag gat gta gca 172 8 
He He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala 
565 570 575 

cca ect atg gaa gaa gaa att gtc cca ggc aat gat acg aca tec ccc 1776 
Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro 
580 585 590 

aaa gaa aca gag aca aca ctt cca ata aaa atg gac ttg gca cca ect 1824 
Lys Glu Thr Glu Thr Thr Leu Pro He Lye Met Asp Leu Ala Pro Pro 
595 600 605 

gag gat gtg tta ctt acc aaa gaa aca gaa eta gcc cca gcc aag ggc 1872 
Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly 
610 615 620 

atg gtt tea etc tea gaa ata gaa gag get ctg gca aag aat gat gtt 1920 
Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val 
625 630 635 640 

cgc tct gca gaa ata ect gtg get cag gag aca gtg gtc tea gaa aca 1968 
Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr 
645 650 655 

gag gtg gtc ctg gca aca gaa gtg gta ctg ccc tea gat ccc ata aca 2016 
Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr 
660 665 670 

aca ttg aca aag gat gtg aca etc ccc tta gaa gca gag aga ccg ttg 2 064 
Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu 
675 680 685 

gtg acg gac atg act cca tct ctg gaa aca gaa atg acc eta ggc aaa 2112 
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Val Thr Asp Met Thx Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys 
690 695 700 



gag aba get cca ccc aca gaa aca aat ttg ggc atg gcc aaa gac atg 
Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met 
705 710 715 720 



gaa ga^ gag gta acc tea gtc aag gac atg tct ccg tct gca gaa aca 
Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr 
755 760 765 



att gtg gac aac age atg get eca gcc tec gat ctt gca ctg ccc ttg 
lie Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu 
785 790 ,795 BOO 

gaa aca aaa gta gca aca gtt cca att aaa gac aaa gga act gta cag 
Glu thr Lys Val Ala Thr Val Pro lie Lys Asp Lys Gly Thr Val Gin 
805 810 815 

act gaa gaa aaa cca cgt gaa gac tec cag tta gca tct atg cag cac 
Thr Glu Glu Lys Pro Arg Glu Aisp Ser Gin Leu Ala Ser Met Gin His 
820 825 830 



aaa get gca gaa caa atg tct acc tta cca ata gat gca cet tct cca 
Lys Ala Ala Glu Gin Met Ser Thr Leu Pro He Asp, Ala Pro Ser Pro 
850 855 860 

tta gag aac tta gag cag aag gaa acg cct ggc age cag cet tct gag 
Leu Glu Asn Leu Glu Qln Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu 
865 870 875 880 

cct tgc tea gga gta tec egg caa gaa gaa gca aag get get gta ggt 
Pro Gys Ser Gly Val Ser Arg Gin Glu Glu Ala Lys Ala Ala Val Gly 
885 890 895 

gtg act gga aat gac ate act acc ccg cca aac aag gag cca cca cca 
Val Thr Gly Asn Asp He Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro 
900 905 910 

age cca gaa aag aaa gca aag cct ttg gcc acc act caa cct gca aag 
Ser Pro Glu Ly's Lys Ala Lys Pro Lexi Ala Thr Thr Gin Pro Ala Lys 
915 920 925 



2160 



tct cca etc cca gaa tea gaa gtg act ctg ggc aag gac gtg gtt ata 2208 
Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He 
725 730 735 

ctt cca gaa aca aag gtg get gag ttt aac aat gtg act cca ctt tea 2256 
Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser 
740 745 750 



2304 



gag get ccc ctg get aag aat get gat ctg cac tea gga aca gag ctg 2352 
Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu 
770 775 780 



2400 



2448 



2496 



aag gga cag tea aca gta cct cct tgc acg get tea cca gaa cca gtc 2544 
Lys Gly Gin Ser Thr Val Pro Pro Cys Thr Ala Ser Pro Glu Pro Val 
835 840 . 845 



2592 



2640 



2688 



2736 



2784 



act tea aca teg aaa gcc aaa aca cag ccc act tct etc cct aag caa 2832 
Thr Ser Thr Ser Lys Ala Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin 
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930 935 940 

cca get ccc acc acc tct ggk ggg ttg aat aaa aaa ccc atg age etc 2 880 
Pro Ala Pro Thr Thr Ser Qly Gly Leu Asn Lys Lys Pro Met Ser Leu 
945 950 955 960 

gcc tea ggc tea gtg cca get gee cca cac aaa cgc cct get get gcc 2 928 
Ala Ser Gly Ser Val Pro Ala Ala Pro His Lys Arg Pro Ala Ala Ala 
965 970 975 

act get act gcc agg cct tee acc eta cct gcc aga gac gtg aag cca 2976 
Thr Ala Thr Ala Arg Pro Ser Ttir Leu Pro Ala Arg Asp Val Lys Pro 
980 985 990 

aag cca att aca gaa get aag gtt gee gaa aag egg acc tet cca tec 3 024 
Lys Pro lie Thr Glu Ala Lys Val Ala Glu Lys Arg Thr Ser Pro Ser 
99S 1000 1005 

aag cct tea tct gcc cca gcc etc aaa cct gga cct aaa ace ace cca 3 072 
Lys Pro Ser Ser Ala Pro Ala Leu Lys Pro Gly Pro Lys Thr Thr_ Pro 
1010 1015 1020 

acc gtt tea aaa gcc aca tct ccc tea act ctt gtt tec act gga cca 3120 
Thr Val Ser Lys Ala Thr Ser Pro Ser Thr Leu Val Ser Thr Gly Pro 
1025 1030 1035 1040 

agt agt aga agt cca get aca act etg cct aag agg cca ace age ate 3168 
Ser Ser Arg Ser Pro Ala .Thr Thr Leu Pro Lys Arg Pro Thr Ser lie 
1045 1050 1055 

aag act gag ggg aaa cct get gat gtc aaa agg atg act get aag tct 3216 
Lys Thr Glu Gly Lys Pro Ala Asp Val Lys Arg Met Thr Ala Lys Ser 
1060 1065 1070 

gcc tea get gac ttg agt cgc tea aag acc acc tct gcc agt tct gtg 3264 
Ala Ser Ala Asp Leu Ser Arg Ser Iiys Thr Thr Ser Ala Ser Ser Val 
1075 1080 1085 

aag aga aac ace act ccc act ggg gea gea ccc cca gca 'ggg atg act 3312 
Lys Arg Asn Thr Thr Pro Thr Gly Ala Ala Pro Pro Ala Gly Met Thr 
1090 1095 1100 

tee act ega gtc aag ccc atg tct gca cct age cgc tct tct ggg get 33 60 
Ser Thr Arg Val Lys Pro Met Ser Ala Pro Ser Arg Ser Ser Gly Ala 
1105 1110 1115 1120 

ctt tct gtg gac aag aag ccc act tec act aag cct age tec tct get 3408 
Leu Ser Val Asp Lys Lys Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala 
1125 1130 1135 

ccc agg gtg age cgc etg gcc aca act gtt tct gcc cct gac etg aag 3456 
Pro Arg Val Ser Arg Leu Ala Thr Thr Val Ser Ala Pro Asp Leu Lys 
1140 1145 1150 

agt gtt cgc tec aag gtc ggc tct aca gaa aac ate aaa cac cag cct 3504 
Ser Val Arg Ser Lys Val Gly Ser Thr Glu Asn lie Lys His Gin Pro 
1155 1160 1165 

99^ 99° aaa gta gag aaa aaa aca gag gca get acc aca 3 552 

Gly Gly Gly Arg Ala Lys Val Glu Lys Lys Thr Glu Ala Ala Thr Thr 
1170 1175 1180 
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get ggg aag cct gaa cct aat gca gtc act aaa gca gcc ggc tec att 3600 
Ala Gly Lys Pro Glu Pro Asn Ala Val Thr Lys Ala Ala Gly Ser lie 
11B5 1190 1195 1200 



geg agt gca cag aaa ccg cct get ggg aaa gtc cag ata gta tec aaa 
Ala Ser Ala Gin Lys Pro Pro Ala Gly Lys Val Gin lie Val Ser Lys 
1205 1210 1215 



gtg gac ata tec aag gtc tec tec aag tgt ggg tec aaa get aat ate 
Val Asp lie Ser Lys Val Ser Ser Lys Cys Gly Ser Lys Ala Asn lie 
1250 1255 1260 

aag cac aag cct ggt gga gga gat gtc aag att gaa agt cag aag ttg 
Lys His Lys Pro Gly Gly Gly Asp Val Lys lie Glu Ser Gin Lys Leu 
1265 1270 1275 1280 

aac ttc aag gag aag gcc caa gcc aaa gtg gga tec ctt gat aac gfct 
Asn Phe Lys Glu Lys Ala Gin Ala Lys Val Gly Ser Leu Asp Asn Val 
1285 - 1290 1295 



3648 



aaa gtg age tac agt cat att caa tec aag tgt gtt tec aag gac aat 3696 
Lys Val Ser Tyr Ser His He Gin Ser Lys Cys Val Ser Lys Asp Asn 
1220 1225 1230 

att aag cat gtc cct gga tgt ggc aat gtt cag att cag aac aag aaa 3744 
He Lys His Val Pro Gly Cys Gly Asn Val Gin He Gin Asn Lys Lys 
1235 1240 1245 



3792 



3840 



3888 



ggc cac ttt cct gca gga ggt gcc gtg aag act gag ggc ggt ggc agt 3 936 
Gly His Phe Pro Ala Gly Gly Ala Val Lys Thr Glu Gly Gly Gly Ser 
1300 1305 1310 

gag gcc ctt ccg tgt cca ggc ccc ecc get ggg gag gag cca gtc ate 3984 
Glu Ala Leu Pro Cys Pro Gly Pro Pro Ala Gly Glu Glu Pro Val He 
1315 1320 / 1325 

cct gag get geg cct gac cgt ggc gcc cct act tea gee agt ggc etc 4 032 
Pro Glu Ala Ala Pro Asp Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu 
1330 1335 1340 

agt ggc cac acc ace ctg tea ggg ggt ggt gac caa agg gag ccc cag 4080 
Ser Gly His Thr Thr Leu Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin 
1345 1350 1355 1360 

acc ttg gac age cag ate cag gag aca age ate atg gtg age aag ggc 4128 
Thr Leu Asp Ser Gin He Gin Glu Thr Ser He Met Val Ser Lys Gly 
1365 1370 1375 

gag gag ctg ttc ace ggg gtg gtg ccc ate ctg gtc gag ctg gac ggc 4176 
Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 
1380 1385 1390 

gac gta aac ggc cac aag ttc age gtg tee ggc gag ggc gag ggc gat 4224 
. Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
1395 1400 1405 

gcc ace tac ggc aag ctg acc ctg aag ttc ate tgc acc acc ggc aag 4272 
Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 
1410 1415 1420 
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ctg ccc gtg ccc tgg ccc acc etc gtg acc acc ctg acc cac ggc gtg 4320 
Leu Pro Val Pro Trp Pro Tlir Leu Val Thr Thr Leu Thr His Gly Val 
1425 1430 1435 1440 

cag tgc ttc age cgc tac ccc gac cac atg aag cag cac gac ttc ttc 4368 
Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 
1445 1450 1455 

aag tec gcc atg ccc gaa ggc tac gtc cag gag cgc acc ate ttc ttc 4416 
Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 
1460 1465 1470 

aag gac gac ggc aac tac aag acc cgc gcc gag gtg aag ttc gag ggc 4464 
Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
1475 1480 1485 

gac acc ctg gtg aac cgc ate gag ctg aag ggc ate gac ttc aag gag 4512 
Asp Thr Leu Val Asn Airg He Glu Leu Lys Gly He Asp Phe Lys Glu 
1490 1495 . 1500 

gac ggc aac ate ctg ggg cac aag ctg gag tac aac ttc aac age cac 4560 
Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His 
1505 1510 1515 1520 

aac gtc tat ate atg gcc gac aag cag aag aac ggc ate aag gtg aac 4608 
Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 
1525 1530 1535 

ttc aag ate cgc cac aac ate gag gac ggc age gtg cag etc gcc gac 4656 
Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
1540 1545 1550 

cac tac cag cag aac ace ccc ate ggc gac ggc ccc gtg ctg ctg ccc 4704 
His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
1555 1560 1565 

gac aac cac tac ctg age acc cag tec gcc ctg age aaa gac ccc aac 4752 
Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
1570 1575 1580 

gag aag cgc gat cac atg gtc ctg ctg gag ttc gtg ace gcc gcc ggg 4800 
Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
1585 1590 1595 1600 

ate act etc ggc atg gac gag ctg tac aag tag 4833 
ile Thr Leu Gly Met A^p Glu Leu Tyr Lys 
1605 1610 



<210> 22 
<211> 1610 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
EYFP-DEVD-MAP4-EBFP construct 

<400> 22 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 - 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glii Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin. Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 
225 230 235 240 

Gly Asp Glu Val Ajsp Gly Met Ala Asp Leu Ser Leu Val Asp Ala Leu 
245 250 255 

Thr Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met 
260 265 270 

Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val 
275 280 285 

Glu Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly 
290 295 300 

Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu 
305 310 315 320 

Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly 
325 330 335 
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Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu 
340 345 350 

Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp 
355 360 365 

Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala 
370 375 380 

Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe 
385 390 395 400 

Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn 
405 410 415 

Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser 
420 425 430 

Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro 
435 440 445 

Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr lie Glu Thr Leu Gin 
450 455 460 

Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu 
465 470 475 480 

Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp 
485 490 495 

Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu 
500 505 510 

Ala Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val He 
515 520 525 

Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala 
530 535 540 

Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn 
545 550 555 560 

He He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala 
565 570 575 

Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thir Thr Ser Pro 
580 585 590 

Lys Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro 
595 600 605 

Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly 
610 615 620 

Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val 
625 630 635 640 

Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr 
645 650 655 

Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr 
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660 665 670 

Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu 
675 680 685 

Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Gly Lys 
690 695 700 

Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met 
705 710 715 720 

Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val lie 
725 730 735 

Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser 
740 745 750 

Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr 
755 760 765 

Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu 
770 775 780 

He Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu 
785 790 795 BOO 

Glu Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly Thr Val Gin 
805 810 815 

Thr Glu Glu Lys Pro Arg Glu Asp Ser Gin Leu Ala Ser Met Gin His 
820 825 830 

Lys Gly Gin Ser Thr Val Pro Pro Cys Thr Ala Ser Pro Glu Pro Val 
835 840 845 

Lys Ala Ala Glu Gin Met Ser Thr Leu P^o He Asp Ala Pro Ser fro 
850 855 860 

Leu Glu Asn Leu Glu Gin Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu 
865 870 875 880 

Pro Cys Ser Gly Val Ser Arg Gin Glu Glu Ala Lys Ala Ala Val Gly 
885 890 895 

Val Thr Gly Asn Asp He Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro 
900 905 910 

Ser Pro Glu Lys Lys Ala Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys 
915 920 925 

Thr Ser Thr Ser Lys Ala Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin 
930 935 940 

Pro Ala Pro Thr Thr Ser Gly Gly Leu Asn Lys Lys Pro Met Ser Leu 
945 950 955 960 

Ala Ser Gly Ser Val Pro Ala Ala Pro His Lys Arg Pro Ala Ala Ala 
965 970 975 

Thr Ala Thr Ala Arg Pro Ser Thr Leu Pro Ala Arg Asp Val Lys Pro 
980 985 990 
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Lye Pro lie Thr Glu Ala Lys Val Ala Glu Lys Arg Thr Ser Pro Ser 
995 1000 1005 

Lys Pro Ser Ser Ala Pro Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro 
1010 ' 1015 1020 

Thr Val Ser Lys Ala Thr Ser Pro Ser Thr Leu Val Ser Thr Gly Pro 
1025 1030 1035 1040 

Ser Ser Arg Ser Pro Ala Thr Thr Leu Pro Lys Arg Pro Thr Ser lie 
1045 1050 1055 

Lys Thr Glu Gly Lys Pro Ala Asp Val Lys Arg Met Thr Ala Lys Ser 
1060 1065 1070 

Ala Ser Ala Asp Leu Ser Arg Ser Lys Thr Thr Ser Ala Ser Ser Val 
1075 1080 1085 

Lys Arg Asn Thr Thr Pro Thr Gly Ala Ala Pro Pro Ala Gly Met Thr 
1090 1095 1100 

Ser Thr Arg Val Lys Pro Met Ser Ala Pro Ser Arg Ser Ser Gly Ala 
1105 1110 1115 1120 

Leu Ser Val Asp Lys Lys Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala 
1125 1130 1135 

Pro Arg Val Ser Arg Leu Ala Thr Thr Val Ser Ala Pro Asp Leu Lys 
1140 1145 1150 

Ser Val Arg Ser Lys Val Gly Ser Thr Glu Asn lie Lys His Gin Pro 
1155 1160 1165 

Gly Gly Gly Arg Ala Lys Val Glu Lys Lys Thr Glu Ala Ala Thr Thr 
1170 1175 1180 ^ 

Ala Gly Lys Pro Glu Pro Asn Ala Val Thr Lys Ala Ala Gly Ser lie 
1185 1190 1195 1200 

Ala Ser Ala Gin Lys Pro Pro Ala Gly Lys Val Gin He Val Ser Lys 
1205 1210 1215 

Lys Val Ser Tyr Ser His lie Gin Ser Lys Cys Val Ser Lys Asp Asn 
1220 1225 1230 

lie Lys His Val Pro Gly Cys Gly Asn Val Gin He Gin Asn Lys Lys 
1235 1240 1245 

Val Asp He Ser Lys Val Ser Ser Lys Cys Gly Ser Lys Ala Asn He 
1250 1255 1260 

Lys His Lys Pro Gly Gly Gly Asp Val Lys He Glu Ser Gin Lys Leu 
1265 1270 1275 1280 

Asn Phe Lys Glu Lys Ala Gin Ala Lys Val Gly Ser Leu Asp Asn Val 
1285 1290 1295 

Gly His Phe Pro Ala Gly Gly Ala Val Lys Thr Glu Gly Gly Gly Ser 
1300 1305 1310 
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Glu Ala Leu Pro Cys Pro Gly Pro Pro Ala Gly Glu Glu Pro Val lie 
1315 1320 1325 

Pro Glu Ala Ala Pro Asp Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu 
1330 1335 1340 

Ser Gly His Thr Thr Leu Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin 
1345 1350 1355 1360 

Thr Leu Asp Ser Gin lie Gin Glu Thr Ser lie Met Val Ser Lys Gly 
1365 1370 1375 

Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly 
1380 13B5 1390 

Asp Val Ash Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
1395 1400 1405 

Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 
1410 1415 1420 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr His Gly Val 
1425 1430 1435 1440 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 
1445 1450 1455 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 
1460 1465 1470 

Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
1475 14B0 14B5 

Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He Asp Phe Lys Glu 
1490 1495 1500 

Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Phe Asn Ser His 
1505 1510 ,1515 1520 

Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 
1525 1530 1535 

Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
1540 1545 1550 

His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
1555 1560 1565 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
1570 1575 1580 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
1585 1590 1595 1600 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1605 1610 



<210> 23 
<211> 978 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (978) 

<220> 

<223> Description of Artificial Sequence : 

GFP-nucleoluB-Caspase 8-annexin II construct 

<400> 23 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 48 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

1 5 , 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt ace etg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc act act ggc aaa etg cct gtt cea tgg cca aca eta gtc act act 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

etg tgc tat ggt gtt caa tgc ttt tea aga tac eeg gat cat *atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys. 
€5 70 75 80 



egg cat gac ttt ttc aag agt gcc atgf ccc gaa ggt tat gta eag gaa 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



288 



agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 X05 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
lis 120 125 

att gac ttc aag gaa gat ggc aac att etg gga cac aaa ttg gaa tac 432 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
13 0 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 480 

Asn Tyr Asii Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag acc egc cac aac att gaa gat gga age 528 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

cet gtc ctt tta cca gac aac cat tac etg tec aca caa tct gcc ctt 624 

pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
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195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gta aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tec 72 0 
Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

gga aga aaa cgt ata cgt act tac etc aag tee tgc agg egg atg aaa 768 
Gly Arg Lys Arg lie Arg Tiir Tyr Leu Lys Ser Cys Arg Arg Met Lys 
245 250 255 

aga agt ggt ttt gag atg tct cga cct att cct tec cac ctt act cga 816 
Arg Ser Gly Phe Glu Met Ser Arg Pro He Pro Ser His Leu Thr Arg 
260 265 270 

teg gca ggt gtt gaa aca gac gca ggt gtt gaa aca gac gca ggt gtt 864 
Ser Ala Gly Val Glu Thr Asp Ala Gly Val Glu Tlir Asp Ala Gly Val 
275 280 285 

gaa aca gac gca ggt gtt gaa aca gac gca ggt agt act atg tct act 912 
Glu Tlir Asp Ala Gly Val Glu Thr Asp Ala Gly Ser Thr Met Ser Thr 
290 295 300 

gtc cac gaa ate ctg tgc aag etc age ttg. gag ggt gtt cat tct aca 960 
Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr 
305 310 315 320' 

ccc cca agt gee gga tec 578 
Pro Pro Ser Ala Gly Ser 
325 



<210> 24 
<:211> 326 
<212> PRT 

<213> Artificial Sequence ^ 
<220> 

' <223> Description of Artificial Secfuence: 

GFP-nucleoluS"Caspase B-annexin II construct 

<400> 24 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
- 20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys, Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 



48 



wo 00/50872 PCT/USOO/04794 

85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lye Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Ash 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser- 
ies 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 - 230 235 240 

Gly Arg Lys Arg He Arg Thr Tyr Leu Lys Ser Cys Arg Arg Met Lys 
245 250 255 

Arg Ser Gly Phe Glu Met Ser Arg Pro He Pro Ser His Leu Thr Arg 
260 265 270 

Ser Ala Gly Val Glu Thr A^p Ala Gly Val Glu Thr Asp Ala Gly Val 
275 280 285 

Glu Thr Asp Ala Gly Val Glu Thr Asp Ala Gly Ser Thr Met Ser Thr 
290 295 300 

Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr 
305 310 315 320 

Pro Pro Ser Ala Gly Ser 
325 

<210> 25 
<:211> 948 
<212> DNA 
. <213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(948) 

<220> 

<223> Description of Artificial Sequence: 

GFP-nucleolus-Caspase 3-annexin II construct 
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<400> 25 

atg get age aaa gga gaa gaa cte ttc act gga gtt gtc cca att ctt 4 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 .5 10 15 

gtt gaa tta gat ggt gat gtt aae ggc cac aag ttc tct gtc agt gga 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt ace ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Qly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act aet 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa 240 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 8 0 

egg cat gac ttt ttc aag agt gee atg ccc gaa ggt tat gta cag gaa 2 88 
Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

agg ace ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
lOQ 105 110 

gtc aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 432 
lie Asp Phe Lys Glu- Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
13 0 135 / 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 . 150 155 160 

gga ate aaa gtg aac ttc aag ace cgc cac aac att gaa gat gga age 528 
Gly lie Lys Val Asn Phe Lys Thr Arg His. Asn lie Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac eat tac ctg tec aca caa tct gcc ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat cec aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 . 215 220 

gta aca get get ggg att aca eat ggc atg gat gaa ctg tac aac tee 720 
Val Thr Ala Ala Gly lie Thr His Qly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 
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gga aga aaa cgt ata cgt act tac etc aag tec tgc agg egg atg aaa 768 
Gly Arg Lys Arg lie Arg Thr Tyr Leu Lys Ser Cys Arg Arg Met Lys 
245 250 255 

aga agt ggt ttt gag atg tct cga cct att cct tec cac ett act cga 816 
Arg Ser Gly Phe Glu Met Ser Arg Pro lie Pro Ser His Leu Thr Arg 
260 265 270 

teg tat gaa aaa gga ata eca gtt gaa aca gac age gaa gag caa get 864 
Ser Tyr Glu Lye Gly lie Pro Val Glu Thr Asp Ser Glu Glu Gin Ala 
275 280 285 

tat agt act atg tct act gte cac gaa ate ctg tgc aag etc age ttg 912 
Tyr Ser Thr Met Ser Thr Val His Glu lie Leu Cys Lys Leu Ser Leu 
290 295 300 

gag ggt gtt cat tct aea eec eca agt gee gga tec 948 
Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
305 310 315 



<210> 26 
<211> 316 
<212> PRT 

<213> Artificial Sequence 
<220> 

<^23> Description .of Artificial Sequence: 

GPP -nucleolus -Caspase 3-annexin II construct 

<400> 26 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His. Lys Phe Ser Val Ser Gly 
■ 20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 .45 

Cys Thr Thr Gly .Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 
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Gly lie Lys Val Asn Phe Lys Tlir Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
IBO 1B5 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
225 230 235 240 

Gly Arg Lys Arg He Arg Thr Tyr Leu Lya Ser Cys Arg Arg Met Lys 
245 250 255 

Arg Ser Gly Phe Glu Met Ser Arg Pro He Pro Ser His Leu Thr Arg 
260 265 270 

Ser Tyr Glu Lys Gly He Pro Val Glu Thr Asp Ser Glu Glu Gin Ala 
275 280 285 

Tyr Ser Thr Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu 
290 295 300 

Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
305 310 315 



<210> 27 

<211> 2088 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1041) 

<220> 

<223> Description of Artificial Sequence: 
NLS-Fred25-synaptobrevin construct 

<400> 27 

atg aga aga aaa cga caa aag get age aaa gga gaa gaa etc ttc act 4 8 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
X 5 10 15 

gga gtt gtc cca att ctt gtt gaa tta gat ggt gat gtt aac ggc cac 96 
Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 30 

aag ttc tct gtc agt gga gag ggt gaa ggt gat gca aca tac gga aaa 144 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40 45 

ctt acc ctg aag ttc ate fcgc act act ggc aaa ctg cct gtt cca tgg 192 
Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 

50 55 . 60 . 
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cca aca eta gtc act act ctg tgc tat ggt gtt caa tgc ttt tea aga 240 
Pro Thr Leu Val Thr Thr Beu Cys Tyx Gly Val Gin Cys Phe Ser Arg 
65 70 75 80 

tac ccg gat cat atg aaa egg cat gac ttt ttc aag agt gcc atg ccc 288 
Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

gaa ggt tat gta cag gaa agg ace ate ttc ttc aaa gat gac ggc aac 33 6 
Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly As n 
100 105 110 

tac aag aca cgt get gaa gtc aag ttt gaa ggt gat acc ctt gtt aat 384 
Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 



aga ate gag tta aaa ggt att gac ttc aag gaa gat ggc aac att ctg 432 
Arg lie Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
130 135 140 



gga cac aaa ttg gaa tac aac tat aac tea cac aat gta tac ate atg 480 
Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
145 150 155 160 

gea gac aaa caa aag aat gga ate aaa gtg aac ttc aag acc egc cac 52 8 
Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys Thr Arg His 
1S5 170 175 

aac att gaa gat gga age gtt caa eta gca gac cat tat caa caa aat 576 
Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
180 185 190 

act cca att ggc gat ggc cct gtc ctt tta cca gac aac cat tac ctg 624 
Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
195 200 205 

tec aca caa tct gcc ctt teg aaa gat ccc aac gaa aag aga gac cac 672 
Ser Thr Gin Ser Ala Leu Ser Lys Asp Prp Asn Glu Lys Arg Asp His 
210 215 220 

atg gtc ctt ctt gag ttt gta aca get get ggg att aca cat ggc atg 72 0 
Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
225 230 235 240 

gat gaa ctg tac aac acc ggt atg tct aca ggt cca act get gcc act 768 
Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Pro Thr Ala Ala Thr 
245 250 255 

ggc agt aat ega aga ctt cag cag aca caa aat caa gta gat gag gtg 816 
Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin Val Asp Glu Val 
260 265 270 

gtg gac ata atg- cga gtt aac gtg gac aag gtt ctg gaa aga gac cag 864 
Val Asp He Met Arg Val Asn Val Asp Lys Val Leu Glu Arg Asp Gin 
275 280 285 

aag etc tct gag tta gac gac cgt gca gac gca ctg cag gea ggc get 912 
Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu Gin Ala Gly Ala 
290 295 300 

tct caa ttt gaa acg age gca gcc aag ttg aag agg aaa tat tgg tgg 960 
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Ser Qln Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg hys Tyx Trp Trp 
305 310 315 320 

aag aat tgc aag atg tgg gca ate ggg att act gtt ctg gtt ate ttc 10 08 
Lys Asn Cys Lys Met Trp Ala He Gly He Thr Val Leu Val He Phe 
325 330 335 

ate ate ate ate ate gtg tgg gtt gtc tct tea tgaatgagaa gaaaacgaca 1061 
He He He He He Val Trp Val Val Ser Ser 
340 345 

aaaggctagc aaaggagaag aactcttcac tggagttgtc ccaattcttg ttgaattaga 1121 

tggtgatgtt aacggccaca agttctctgt cagtggagag ggtgaaggtg atgcaacata 1181 

cggaaaactt accctgaagt tcatctgcac tactggcaaa ctgcctgttc catggecaac 1241 

actagtcact actctgtgct atggtgttca atgcttttca agatacccgg atcatatgaa 13 01 

acggcatgac tttttcaaga gtgccatgcc cgaaggttat gtacaggaaa ggaccatctt 13 61 

cttcaaagat gacggcaact acaagacacg tgetgaagtc aagtttgaag gtgataccct 1421 

tgttaataga atcgagttaa aaggtattga cttcaaggaa gatggeaaca ttctgggaca 1481 

caaattggaa tacaactata actcacacaa tgtatacatc atggcagaca aacaaaagaa 1541 

tggaatcaaa gtgaacttca agacccgcca caacattgaa gatggaagcg ttcaactage 1601 

agaccattat caacaaaata ctccaattgg cgatggccct gtccttttac cagacaacca 1661 

ttacctgtcc acacaatctg ccctttcgaa agatcecaac gaaaagagag accacatggt 1721 

ccttcttgag tttgtaacag ctgctgggat tacacatggc atggatgaac tgtacaacac 1781 

cggtatgtct acaggtccaa ctgctgccac tggcagtaat cgaagacttc agcagacaca 1841 

aaatcaagta gatgaggtgg tggacataat gcgagt^taac gtggacaagg ttctggaaag 1901 

agaecagaag ctctctgagt tagacgaccg tgcagacgca ctgcaggcag gcgcttctca 1961 

atttgaaacg agcgcagcca agttgaagag gaaatattgg tggaagaatt gcaagatgtg 2021 



ggcaatcggg attactgttc tggttatctt catcatcatc atcatcgtgt gggttgtctc 2081 
ttcatga 2088 

<210> 28 
<211> 347 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
. NLS-Fred25-synaptobrevin construct 

<400> 28 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
1 5 10 15 
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Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 30 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Tlir Tyr Gly Lys 
35 40 43 

Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 

Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
55 70 75 80 

Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
100 105 110 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
130 135 140 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
145 ISO 155 160 

Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys Thr Arg His 
165 170 175 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
180 185 190 

Thr Pro He Gly Asp Gly Pro Val Leu Leii Pro Asp Asn His Tyr Leu 
195 ^ 200 205 

Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215 220 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
225 230 235 240 

Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Pro Thr Ala Ala Tiir 
245 250 255 

Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin Val Asp Glu Val 
260 265 270 

Val Asp He Met Arg Val Asn Val Asp Lys Val Leu Glu Arg Asp Gin 
275 280 285 

Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu Gin Ala Gly Ala 
290 295 300 

Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg Lys Tyr Trp Trp 
305 310 315 320 

Lys Asn Cys Lys Met Trp Ala He Gly He Thr Val Leu Val He Phe 
325 330 335 

He He He He He Val Trp Val Val Ser Ser 
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340 345 



<210> 29 
<211> 2106 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1050) 

<220> 

<223> Description of Artificial Sequence: 
KLS-Fred25-cellubrevin construct 

<400> 29 

atg aga aga aaa cga caa aag get age aaa gga gaa gaa etc ttc act 48 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Tiir 
15 10 15 

gga gtt gtc cca att ctt gtt gaa tta gat ggt gat gtt aac ggc cac 96 
Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly Hxs 
20 25 30 

aag ttc tct gtc agt gga gag ggt gaa ggt gat gca aca tac gga aaa 144 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40 - 45 

ctt acc ctg aag ttc ate tgc act act ggc aaa ctg cct gtt cca tgg 192 
Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 

cca aca eta gtc act act ctg tgc tat ggt gtt caa tgc ttt tea aga 240 
Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 /70 75 80 

tac ccg gat cat atg aaa egg cat gac ttt ttc aag agt gee atg ccc 288 
Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

gaa ggt tat gta eag gaa agg acc ate ttc ttc aaa gat gac ggc aac 336 
Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 
100 105 110 

tac aag aca cgt get gaa gtc aag ttt gaa ggt gat acc ctt gtt aat 3 84 
Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

aga ate gag tta aaa ggt att gac ttc aag gaa gat ggc aac att ctg 432 
Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 
130 135 140 

gga cac aaa ttg gaa tac aac tat aac tea cac aat gta tac ate atg 480 
Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met 
145 150 155 160 

gca gac aaa caa aag aat gga ate aaa gtg aac ttc aag acc cgc cac 528 
Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys Thr Arg His 
165 170 175 
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aac att gaa gat gga age gtt caa eta gca gac cat tat caa caa aat 576 
Asn lie Qlu Asp Gly Ser Val Gin Leu Ala Asp Hie Tyx Gin Gin Asn 
180 185 190 

act cca att ggc gat ggc cct gtc ctt tta cca gac aac cat tac ctg €24 
Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
155 200 205 

tec aca caa tct gcc ctt teg aaa gat ccc aac gaa aag aga gac cac 672 
Ser Tlir Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215 220 

atg gtc ctt ctt gag ttt gta aca get get ggg att aca cat ggc atg 720 
Met Val Leu Leu Glu Phe Vail Thr Ala Ala Gly He Thr His Gly Met 
225 230 235 240 

gat gaa ctg tac aac acc ggt atg tct aca ggt gtg cct teg ggg tea 768 
Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Val Pro Ser Gly Ser 
245 250 255 

agt get gcc act ggc agt aat cga aga etc cag cag aca caa aat caa 8i6 
Ser Ala Ala Thr Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin 
260 265 270 

gta gat gag gtg gtt gac ate atg aga gtc aat gtg gat aag gtg tta 864 
Val Asp Glu Val Val Asp He Met Arg Val Asn Val Asp Lys Val Leu 
275 280 285 

gaa aga gac cag aag etc teg gag eta gat gac cgc gca gat gca ctg 912 
Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu 
290 295 300 

cag gca ggt gcc teg cag ttt gaa aca agt get gcc aag ttg aag aga 960 
Gin Ala Gly Ala Seir Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg 
305 ^ 310 315 320 

aag tat tgg tgg aag aac tgc aag atg tgg gcg ata ggg ate agt gtc 1008 
Lys Tyr Trp Trjp Lys Asn Cye Lys Met Trp Ala He Gly He Ser Val 
325 330 335 

ctg gtg ate att gtc ate ate ate ate gtg tgg tgt gtc tct 1050 
Leu Val He He Val He He He He Val Trp Cys Val Ser 
340 345 350 



taaatgagaa 


gaaaacgaca 


aaaggctage 


aaaggagaag 


aactcttcac 


tggagttgte 


1110 


ceaattcttg 


ttgaattaga 


tggtgatgtt 


aacggecaca 


agttctctgt 


cagtggagag 


1170 


ggtgaaggtg 


atgcaacata 


cggaaaactt 


accctgaagt 


tcatetgcac 


taetggcaaa 


1230 


ctgectgttc 


catggceaac 


actagtcact 


■aetetgtgct 


atggtgttea 


atgcttttca 


1290 


agataccegg 


atcatatgaa 


acggcatgac 


tttttcaaga 


gtgccatgcc 


cgaaggttat 


1350 


gtacaggaaa 


ggaccatctt 


ctteaaagat 


gaeggcaact acaagacacg 


tgctgaagtc 


1410 


aagtttgaag 


gtgatacect 


tgttaataga 


atcgagttaa 


aaggtattga 


ettcaaggaa 


1470 


gatggcaaea 


ttctgggaca 


caaattggaa 


tacaactata 


aeteacacaa 


tgtatacatc 


1530 


atggcagaca 


aaeaaaagaa 


tggaatcaaa 


gtgaacttca agacccgeca 


eaacattgaa 


1590 
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gatggaagcg 


ttcaactagc 


agaccattat 


caacaaaata 


ctccaattgg 


cgatggccct 


1650 


gtccttttac 


cagacaacca 


ttacctgtcc 


acacaatctg 


ccctttcgaa 


agatcccaac 


1710 


gaaaagagag 


accacatggt 


ccttcttgag 


tttgtaacag 


ctgctgggat 


tacacatggc 


1770 


atggatgaac 


tgtacaacac 


cggtatgtct 


acaggtgtgc 


cttcggggtc 


aagtgctgcc 


1830 


actggcagta 


atcgaagact 


ccagcagaca 


caaaatcaag 


tagatgaggt 


ggttgacatc 


1890 


atgagagtca 


atgtggataa 


ggtgttagaa 


agagaccaga 


agctctcgga 


gctagatgac 


1950 


cgcgcagatg 


cactgcaggc 


aggtgcctcg 


cagtttgaaa 


caagtgctgc 


caagttgaag 


2010 


agaaagtatt 


ggtggaagaa 


ctgcaagatg 


tgggcgatag 


ggatcagtgt 


cctggtgatc 


2070 


attgtcatca 


tcatcatcgt 


gtggtgtgtc 


tcttaa 






2106 



<210> 30 
<211> 350 
<212> PRT 

<213> Artificial Seqxience 
<220> 

<223> Description of Artificial Sequence: 
NIiS-Fred25-cellubrevin construct 

<400> 30 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
15 10 15 

Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
20 25 30 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
35 40 . 45 

Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
50 55 60 

Pro Thr Leu Val Thr Thr Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg 
65 70 75 80 

Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
100 105 110 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
115 120 125 

Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
130 135 140 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
145 150 155 ISO 

Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys Thr Arg His 
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165 170 175 

Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyx Gin Gin Asn 
IBO 185 190 

Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
195 200 205 

Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
210 215 220 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
225 230 235 240 

Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Val Pro Ser Gly Ser 
245 250 255 

Ser Ala Ala Thr Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin 
260 265 270 

Val Asp Glu Val Val Asp He Met Arg Val Asn Val Asp Lys Val Leu 
275 280 285 

Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu 
290 295 300 

Gin Ala Gly Ala Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg 
305 310 315 - 320 

Lys Tyr Trp Tjcp Lys Asn Cys Lys Met Trp Ala He Gly He Ser Val 
325 330 335 

Leu Val He He Val He He He He Val Trp Cys Val Ser 
340 345 350 



<210> 31 
<211> 3171 
<212> DMA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (3168) 

<220> 

<223> Description of Artificial Sequence: 
NLS-EYFP-MAPKDM-EBFP construct 

<400> 31 

atg agg ccc aga aga aag gtg age aag ggc gag gag ctg ttc acc ggg 4 8 
Met Arg Pro Arg Arg Lys Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
15 10 15 

gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac ggc cac aag 96 
val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
20 25 30 

ttc age gtg tec ggc gag ggc gag ggc gat gcc acc tac ggc aag ctg 144 
Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
35 40 45 
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acc ctg aag 
Thr Leu Lys 
50 

acc etc gfcg 
Thr Leu Val 
€5 

ccc gac cac 
Pro Asp His 



ggc tac gtc 
Gly Tyr Val 



aag acc cgc 
Lys Thr Arg 
1X5 

ate gag ctg 
lie Glu Leu 
130 

cac aag ctg 
* His Lys Leu 
145 

gac aag cag 
Asp Lys Gin 



ate gag gac 
lie Glu Asp 



ccc ate ggc 
Pro He Gly 
195 

tac cag tec 
Tyr Gin Ser 
210 

gtc ctg ctg 
Val Leu Leu 
225 

gag ctg tac 
Glu Leu Tyr 



gtg gat gcg 
Val Asp Ala 



cga gac ttc 
Arg Asp Phe 
275 



ttc ate tgc acc 
Phe He Cys Thr 

, 55 

acc acc ttc ggc 
Thr Thr Phe Gly 
70 

atg aag cag cac 
Met Lys Gin His 
85 

cag gag cgc acc 
Gin Glu Arg Thr 
100 

gcc gag gtg aag 
Ala Glu Val Lys 



aag ggc ate gac 
Lys Gly He Asp 
135 

gag tac aac tac 
Glu Tyr Asn Tyr 
150 

aag aac ggc ate 
Lys Asn Gly He 
165 

ggc age gtg cag 
Gly Ser Val Gin 
180 

gac ggc ccc gtg 
Asp Gly Pro Val 



gcc ctg age aaa 
Ala Leu Ser Lys 
215 

gag ttc gtg acc 
Glu Phe Val Thr 
230 

aag aag gga gac 
Lys Lys Gly Asp 
245 

ttg aca gaa cca 
Leu Thr Glu Pro 
260 

atg get gcg ctg 
Met Ala Ala Leu 



acc ggc aag 
Thr Gly Lys 



tac- ggc ctg 
Tyr Gly Leu 



gac ttc ttc 
Asp Phe Phe 
90 

ate ttc ttc 
He Phe Phe 
105 

ttc gag ggc 
Phe Glu Gly 
120 

ttc aag gag 
Phe Lys Glu 



aac age cac 
Asn Ser His 



aag gtg aac 
Lys Val Asn 
170 

etc gcc gac 
Leu Ala Asp 
185 

ctg ctg ccc 
Leu Leu Pro 
200 

gac ccc aac 
Asp Pro Asn 



gee gcc ggg 
Ala Ala Gly 



gaa gtg gac 
Glu Val Asp 
250 

cct cca gaa 
Pro Pro Glu 
2 65 

gag gca gag 
Glu Ala Glu 
280 



Ctg ccc gtg ccc 
Leu Pro Val Pro 
60 

cag tgc ttc gee 
Gin Cys Phe Ala 
75 

aag tec gcc atg 
Lys Ser Ala Met 



aag gac gac ggc 
Lys Asp Asp Gly 
110 

gac acc ctg gtg 
Asp Thr Leu Val 
125 

gac ggc aac ate 
Asp Gly Asn He 
140 

aac gtc tat ate 
Asn Val Tyr He 
155 

ttc aag ate cgc 
Phe Lys He Arg 



cac tac cag cag 
His Tyr Gin Gin 
190 

gac aac cac tac 
Asp Asn His Tyr 
205 

gag aag cgc gat 
Glu Lys Arg Asp 
220 

ate act etc ggc 
He Thr Leu Gly 
235 

gga gcc gac etc 
Gly Ala Asp Leu 



att gag gga gaa 
He Glu Gly Glu 
270 

ccc tat gat gac 
Pro Tyr Asp Asp 
285 



tgg ccc 192 
Trp Pro 



cgc tac 240 
Arg Tyr 
80 

ccc gaa 288 
Pro Glu 
95 

aac tac 336 
Asn Tyr 



aac cgc 3 84 
Asn Arg 



ctg ggg 432 
Leu Gly 



atg gcc 480 
Met Ala 
160- 

cac aac 528 

His Asn 

175 

aac ace 576 
Asn Thr 



ctg age 624 
Leu Ser 



cac atg 672 
His Met 



atg gac 720 
Met Asp 
240 

agt ctt 768 

Ser Leu 

255 

ata aag 816 
He Lys 



ate gtg 864 
He Val 
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gga gaa act gtg gag aaa act gag ttt att cct etc ctg gat ggt gat 912 
Gly Glu Thr Val Glu Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp 
290 295 300 

gag aaa acc ggg aac tea gag tec aaa aag aaa ccc tgc tta gac act 960 
Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr 
305 310 315 320 

age cag gtt gaa ggt ate cca tct tct aaa cca aca etc eta gee aat 1008 
Ser Gin Val Glu Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn 
325 330 335 

ggt gat cat gga atg gag ggg aat aac act gca ggg tct cca act gac 1056 
Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp 
340 345 350 

ttc ctt gaa gag aga gtg gac tat ccg gat tat cag age age eag aac 1104 
Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn 
355 360 365 

tgg cca gaa gat gca age ttt tgt ttc cag cct cag caa gtg tta gat 1152 
Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp 
370 375 380 

act gac cag get gag ccc ttt aac gag cac cgt gat gat ggt ttg gca 1200 
Thr Asp Gin Ala Glu Pro Phe- Asn Glu His Arg Asp Asp Gly Leu Ala 
385 390 395 400 

gat ctg etc ttt gte tec agt gga ccc aeg aac get tct gca ttt aca 1248 
Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr 
405 410 415 

gag cga gac aat cct tea gaa gac agt tac ggt atg ctt ccc tgt gac 1296 
Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp 
X 420 425 430 

tea ttt get tec aeg get gtt gta tct eag gag tgg tct gtg gga gcc 1344 
Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala 
435 440 445 

cca aac tct cca tgt tea gag" tec tgt gtc tee cca gag gtt act ata 1392 
Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr He 
450 455 460 

gaa acc eta cag cca gca aca gag etc tec aag gca gca gaa gtg gaa 1440 
Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu 
465 470 475 480 

tea gtg aaa gag cag ctg cca get aaa gca ttg gaa aeg atg gca gag 1488 
Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu 
485 490 495 

cag acc act gat gtg gtg cac tct cca tec aca gac aca aca cca ggc 1536 
Gin Thr Thr Asp Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly 
500 505 510 

cca gac aca gag gca gca ctg get aaa gac ata gaa gag ate acc aag 15 84 
Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp He Glu Glu He Thr Lys 
515 520 . 525 

cca gat gtg ata ttg gca aat gtc aeg cag cca tct act gaa teg gat 1632 



61 



wo 00/50872 



PCT/USOO/04794 



Pro Asp Val lie Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp 
530 535 540 

atg ttc ctg gcc cag gac atg gaa eta etc aca gga aca gag gca gcc 16B0 
Met Phe Leu Ala Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala 
545 550 555 560 

cac get aac aat ate ata ttg cct aca gaa eca gac gaa tct tea acc 172 B 
His Ala Asn Asn lie He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr 
565 570 575 

aag gat gta gca eca ect atg gaa gaa gaa att gtc eea ggc aat gat 1776 
Lys Asp Val Ala Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp 
580 585 590 

aeg aca tee ccc aaa gaa aca gag aca aca ctt eca ata aaa atg gac 1824 
Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp 
595 600 605 

ttg gca eca cct gag gat gtg tta ctt acc aaa gaa aca gaa eta gee 18 72 
Leu Ala Pro Pro Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala 
610 615 620 

cea gcc aag ggc atg gtt tea etc tea gaa ata gaa gag get ctg gca 1920 
Pro Ala Lys Gly Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala 
625 630 635 640 

aag aat gat gtt cgc tct gca gaa ata cct gtg get cag gag aca gtg 1968 
Lys Asn Asp Val Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val 
645 650 655 

gtc tea gaa aca gag gtg gtc ctg gca aca gaa gtg gta ctg ccc tea 2016 
Val Ser Glu Thr Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser 
660 665 670 

gat ccc ata aca aca ttg aca aag gat gtg aca etc ccc tta gaa gca 2064 
Asp Pro He Thr Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala 
675 680 . 685 

gag aga ccg ttg gtg aeg gac atg act eca tct ctg gaa aca gaa atg 2112 
Glu Arg Pro Leu Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met 
690 695 700 

acc eta ggc aaa gag aca get eca ccc aca gaa aca aat ttg ggc atg 2160 
Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met 
705 710 715 720 

gcc aaa gac atg tct eca etc eca gaa tea gaa gtg act ctg ggc aag 2208 
Ala Lys Asp Met Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys 
725 730 735 

gac gtg gtt ata ctt eca gaa aca aag gtg get gag ttt aac aat gtg 2256 
Asp Val Val He .Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val 
740 745 750 . 

act eca ctt tea gaa gaa gag gta acc tea gtc aag gac atg tct ccg 23 04 
Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro 
755 760 765 

tct gca gaa aca gag get ccc ctg get aag aat get gat ctg cac tea 2352 
Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser 
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770 775 780 

gga aca gag ctg att gtg gac aac age atg get cca gcc tec gat ctt 2400 
Gly Thr Glu Leu lie Val Asp Asn Ser Met Ala Pro Ala Ser Asp i#eu 
785 790 795 800 

gca ctg ccc ttg gaa aca aaa gta gca aca gtt cca att aaa gac aaa 2448 
Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val Pro lie Lys Asp Lys 
805 810 815 

gga atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate 2496 
Gly Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
620 825 830 

ctg gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec 2544 
Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 
835 840 845 

ggc gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc 2592 
Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
650 655 860 

ate tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc 2640 
He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 
865 870 875 880 

acc ctg ace cac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg 2688 
Thr Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
685 890 895 

aag cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag 2736 
Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
900 905 910 

gag cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc 2784 
Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Afg Ala 
915 920 925 

gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag 2 832 
Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys 
930 935 940 

ggc ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag 2880 
Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu 
945 950 955 960 

tac aac ttc aac age cac aac gtc tat ate atg gcc gac aag cag aag 2928 
Tyr Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 
965 970 975 

aac ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc 2976 
Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 
980 985 990 

age gtg cag etc gcc gac cac tac cag cag aac ace ccc ate ggc gac 3 024 
Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 
995 1000 1005 

ggc ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc 3072 
Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
1010 1015 1020 
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ctg age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag 312 0 
Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 
1025 1030 1035 1040 

ttc gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag 316B 
Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1045 1050 1055 



tag 



<210> 32 
<211> 1056 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
NLS-EYFP-MAPKDM-EBFP construct 

<400> 32 

Met Arg Pro Arg Arg Lys Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
1 5 10 15 

Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
20 25 30 

Phie Ser. Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
35 40 45 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
50 55 60 

Thr Leu Val Thr Thr Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr 
65. 70 75 80 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
85 90 95 

Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
100 105 110 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
115 120 125 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
130 135 140 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 
145 150 155 160 

Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
165 170 175 

He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Th.r 
180 185 190 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
195 200 205 



3171 
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Tyx Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Aep His Met 
210 215 220 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
225 230 235 240 

Glu Leu Tyr Lys Lys Gly Asp GLu Val Asp Gly Ala Asp Leu Ser Leu 
245 250 255 

Val Asp Ala Leu Thr Glu Pro Pro Pro Glu He Glu Gly Glu He Lys 
260 265 270 

Arg Asp Phe Met Ala Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val 
275 280 285 

Gly Glu Thr Val Glu Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp 
290 295 300 

Glu Lys Thr Gly Asrt Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr 
305 310 315 320 

Ser Gin Val Glu Gly He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn 
325 330 335 

Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp 
340 345 350 

Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn 
355 360 365 

Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp 
370 375 380 

Thr Asp Gin Ala Glu Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala 
385 390 395 400 

Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr 
405 410 415 

Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp 
420 425 430 

Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala 
435 440 445 

Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr He 
450 455 460 

Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu 
465 470 475 480 

Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu 
. 485 490 495 

Gin Thr Thr Asp Val Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly 
500 505 510 

Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp He Glu Glu He Thr Lys 
515 520 525 

Pro Asp Val He Leu Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp 
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530 535 540 

Met Phe Leu Ala Gin Asp Met Glu Leu Leu Thr Gly Thr Glu Ala Ala 
545 550 555 560 

His Ala Asn Asn He He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr 
56S 570 575' 

Lys Asp Val Ala Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp 
580 585 590 

Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp 
595 600 605 

Leu Ala Pro Pro Glu Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala 
610 615 620 

Pro Ala Lys Gly Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala 
625 630 635 640 

Lys Asn Asp Val Arg Ser Ala Glu He Pro Val Ala Gin Glu Thr Val 
645 650 655 

Val Ser Glu Thr Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser 
660 665 670 

Asp Pro He Thr Thr Leu Thr "Lys Asp Val Thr Leu Pro Leu Glu Ala 
675 680 - 685 

Glu Arg Pro Leu Val Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met 
690 695 700 

Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met 
705 710 715 720 

Ala Lys Asp- Met Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys 
725 730 735 

Asp Val Val He Leu Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val 
740 745 750 

Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro 
755 760 765 

Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser 
770 775 780 

Gly Thr Glu Leu He Val Asp Asn Ser Met T^a Pro Ala Ser Asp Ijeu 
785 790 795 800 

Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val Pro He Lys Asp Lye 
805 810 815 

Gly Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
820 825 830 

Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 
835 840 845 

Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
aso 855 860 
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He Cys Thr Tlir Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val TJar 
865 870 875 880 

Thr Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
885 890 895 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
900 905 910 

Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
915 920 925 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys 
930 935 940 

Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu 
945 950 955 960 

Tyr Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 
965 970 975 

Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 
980 985 990 

Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 
995 1000 1005 

Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
1010 1015 1020 

Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 
1025 1030 1035 1040 

phe Val Thr AJ-a l^a Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1045 1050 1055 



<210> 33 
<211> 1623 
<212> DHA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1).,(1623) 



<220> 

<223> Description of Artificial Sec[uence: 

YPP-NLS-CP3 -multiple DEVD-CFP-Annexin II construct 



<400> 33 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag tte ate 144 
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Glu Gly Glu Gly Asp Ala Thr Tyx Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ttc ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc gac cac atg aag 240 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lye 
65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 2BB 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 



gac gca ggt agt act atg gtg age aag ggc gag gag ctg ttc acc ggg 
Asp Ala Gly Ser Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 



68 



384 



ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lye Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac ><:ac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
ISO 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 624 
pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga aga agg aaa ega caa aag cga teg gca ggt gac gaa gtt gat gca 7 68 
Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Asp Ala 
245 250 255 

ggt gac gaa gtt gat gca gcft gac gaa gtt gat gca ggt gac gaa gtt 816 
Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 
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275 280 285 

gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac ggc cac aag 912 
Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly Hie Lya 
290 29S 300 

ttc age gtg tec ggc gag ggc gag ggc gat gcc acc tac ggc aag ctg 960 
Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
305 310 315 320 

acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc 1008 
Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
325 330 335 

acc etc gtg acc acc ctg acc tgg ggc gtg cag tgc ttc age cgc tac 1056 
Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gin cys Phe Ser Arg Tyr 
340 345 350 

ccc gac cac atg aag cag cac gac ttc ttc aag tec gcc atg ccc gaa 1104 
Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
355 360 365 

ggc tac gtc cag gag cgc acc ate ttc ttc aag gac gac ggc aac tac . 1152 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
370 375 380 

aag acc cgc gcc gag gtg aag ttc gag ggc gac acc ctg gtg aac cgc 1200 
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
385 390 395 400 

ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac ate ctg ggg 1248 
He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
405 410 415 

cac aag ctg gag tac aac tac ate age cac aac gtc tat ate acc gcc 1296 
His Lys Leu Glu Tyr Asii Tyr He Ser His Asn Val '1?yr He Thr Ala 
420 425 430 

gac aag cag aag aac ggc ate aag gcc aac ttc aag ate cgc cac aac 1344 
Asp Lys Gin Lys Asn Gly He Lys Ala Asn Phe Lys He Arg His Asn 
435 440 445 

ate gag gac ggc age gtg cag etc gcc gac cac tac cag cag aac acc 13 92 
He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr • 
450 455 460 

ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac tac ctg age 1440 
Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
465 470 475 480 

acc cag tec gee ctg age aaa gac ccc aac gag aag cgc gat cac atg 14 88 
Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
485 490 495 

gtc ctg ctg gag ttc gtg acc gcc gcc ggg ate act etc ggc atg gac 1536 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
500 505 510 

gag ctg tac aag atg tct act gtc cac gaa ate ctg tgc aag etc age 1584 
Glu Leu Tyr Lys Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser 
515 520 525 
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ttg gag ggt gtt cat tct aca ccc cca agt gcc gga tec 
Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
530 535 540 



<210> 34 

<211> 541 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

YFP-NIiS-CP3 -multiple DEVD-CFP-Aimexin II construct 

<400> 34 

Met ' Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60. 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 / 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asi:i Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 
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Gly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Aep Ala 
245 250 255 

Gly Asp Glu Val Asp Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 
260 265 270 

Asp Ala Gly Ser T3ar Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
275 2B0 285 

Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
290 295 300 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
305 310 315 320 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
325 330 335 

Thr Leu Val Thr Thr Leu Tlir Trp Gly Val Gin Cys Phe Ser Arg Tyr 
340 345 350 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
355 360 365 

Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
370 375 380 . 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
385 390 395 400 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
405 410 415 

His Lys Leu Glu Tyr Asn Tyr He Ser His Asn Val Tyr He Thr Ala 
420 425 430 

Asp Lys Gin Lys Asn Gly He Lys Ala Asn Phe Lys He Arg His Asn 
435 440 . 445 

He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
450 455 460 . 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
465 470 475 480 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
485 490 495 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
500 505 510 

Glu Leu Tyr Lys Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser 
515 520 525 

Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
530 535 540 



<210> 35 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FIjAG epitope 
<400> 35 

gactacaaag acgacgacga caaa 24 

<210> 36 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; FLAG epitope 
<400> 36 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 37 
<211> 27 
<212> DKTA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HA epitope 
<400> 37 

tacccatacg acgtaccaga ctacgca 27 
<210> 38 

<211> 9 / 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HA epitope 
<400> 38 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1 5 



<210> 39 
<211> IB 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: KT3 epitope 
<400> 39 

ccaccagaac cagaaaca 18 



<210> 40 
<211> 6 • 
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<212> PRT 

<213> Artificial Seguezice 
<220> 

<223> Description of Artificial Sequence: KT3 epitope 
<400> 40 

Pro Pro Glu Pro Glu Thr 
1 5 



<210> 41 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Myc epitope 
<400> 41 

gcagaagaac aaaaattaat aagcgaagaa gactta 36 



<210> 42 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Myc epitope 
<400> 42 

Ala Glu Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
15 10 



<210> 43 
<211> 717 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1), .(717) 

<220> 

<223> Description of Artificial Sequence: EYFP 



<400> 43 

atg gtg age aag ggc gag gag ctg ttc 
Met Val Ser Lys Gly Glu Glu Leu Phe 
1 5 



acc ggg gtg gtg ccc ate ctg 4 8 
Thr Gly Val Val Pro lie Leu 
10 15 



gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 -30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 
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tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Tlir Thr 
50 55 60 

ttc ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc gac cac atg aag 240 
Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 . 75 BO 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 2B8 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 33 6 
Arg Thr Xle Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser- 
ies 170 175 

gtg cag etc gcc gac cac tac cag cag aac ace ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC gtg ctg ctg ccc gac aac cac tac ctg age tac cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 44 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: EYFP 
<400> 44 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp Hi^ Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 lio 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Ash lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Oln Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Ash His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 / 205 ^ 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 22 b 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

230 235 



225 




<210> 


45 


<211> 


717 


<212> 


DNA 


<213> 


Artificial I 


<220> 




<221> 


CDS 


<222> 


(1) . . (717) 


<220> 




<223> 


Description 


<4O0> 


45 



atg gtg age aag ggc gag gag ctg ttc acc. ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Iiys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thir 
50 55 60 



192 



ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 . 80 



336 



cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 12 0 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac^atc gag gac gg9 age 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 17Q 175- 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 ISO 



528 



672 



ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gee ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 46 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: EGFP 
<400> 46 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Aen Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 ~ 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly ^er 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asi> Thr Pro He Gly Asp Gly 
180 1B5 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 2D0 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His. Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 47 

<211> 717 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (717) 

<220> 

<:223> Description of Artificial Sequence: EBFP 
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<400> 47 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

1 5 10 IS 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 95 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

gag ggc gag ggc gat gee acc tae ggc aag ctg aec etg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

tge acc acc ggc aag ctg ccc gtg cec tgg ccc acc etc gtg acc acc 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

ctg acc cac ggc gtg cag tge ttc age cgc tac ccc gac cac atg aag 24 0 

Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 80 



cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



CCC gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gee ctg 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



268 



cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr ^ 
130 135 140 

aac ttc aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 480 
Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 ISO 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac ace ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



624 



age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 
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<210> 48 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: EBPP 
<400> 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asa Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
- 35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
13 0 135 140 

Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 ISO 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 . 235 



<210> 49 
<211> 717 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (1) . . (717) 

<220> 

<223> Description of Artificial Sequence: ECFP 
<400> 49 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie lieu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu lieu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



ctg acc tgg ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 

Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

€5 - 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 



144 



tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg ace ace 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



240 



288 



cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyx Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac ate age cac aac gtc tat ate acc gee gac aag cag aag aac 4 80 
Asn Tyr He Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
. 145 150 155 160 

ggc ate aag gee aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Ala Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 
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age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 50 




<211> 239 




<212> PRT 




<213> Artificial Sequence 




<220> 




<223> Description of Artificial 


Sequence : ECFP 


<:400> 50 




Met Val Ser Lys Gly Glu Glu Leu 


Phe Tlir Gly Val 


X 5 


10 



15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr 'Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr He Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Ala Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys ^Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
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225 



230 



235 



<210> 51 

<211> 720 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) , . (717) 



<220> 

<223> Description of Artificial Sequence: Fred25 
<400> 51 

atg get age aaa gga gaa gaa etc ttc act gga gtt gtc cca att ctt 
Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

gtt gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt acc ctg aag ttc ate 
GluJSly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc act act ggc aaa ctg ect gtt cca tgg eca aca eta gtc act act 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 ^0 

ctg tgc tat ggt gtt caa tgc ttt tea aga tac ceg gat cat atg aaa 
Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

egg cat gac ttt ttc aag agt gee atg ecc gaa ggt tat gta cag gaa 
Arg His Asp Phe Phe Lys Ser Ala Met Pr9 Glu Gly Tyr Val Gin Glu 
85 90 95 

agg acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtc aag ttt gaa gigt gat acc ctt gtt aat aga ate gag tta aaa ggt 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
X15 120 125 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat 
Asn Tyr Asn Ser His Asn. Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

gga ate aaa gtg aac ttc aag ace cgc cac aac att gaa gat gga age 
Gly lie Lys Val Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

gtt caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc 



48 



9i5 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly. Asp Gly 
180 185 190 

cct gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gcc ctt 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Ph.e 
210 215 220 

gta aca get get ggg att aca eat gge atg gat gaa ctg tac aac tag 720 
Val Thr* Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn 
225 230 235 



<210> 52 

<211> 239 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Fred25 

<400> 52 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1^3 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
225 230 235 



<210> 53 
<211> 14 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial. Sequence: Caspase-1, 4, 5 
substrate recognition sequence 

<400> 53 
tgggaacatg acaa 



<210> 54 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-l, 4, 5 
substrate recognition secpience 

<400> 54 
Trp Glu His Asp 
1 



<210> 55 . 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-1 
STibstrate recognition sequence 

<400> 55 
tggtttaaag ac 



<210> 56 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-1 
substrate recognition sequence 

<400> 56 

Trp Phe Lys Asp 
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1 



<210> 57 

<211> 12 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: CaBpase-2 
Bubstrate recognition sequence 

<400> 57 

gacgaacacg ac 12 



<210> 5-8 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of . Artificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 58 
Asp Glu His Asp 
1 



<210> 59 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3 , 7 
substrate recognition sequence 

<400> 59 

gacgaagttg ac 12 



<210> 60 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3, 7 
substrate recognition sequence 

<400> 60 
Asp Glu Val Asp 
1 



<210> 61 

<211> 12 

<212> DKTA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: proCaspase-S 
substrate recognition sequence 

<400> 61 
atagaaacag ac 



<210> 62 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 

<400> 62 
lie Glu Thr Asp 
1 



<210> 63 

<211> 12 

<212> DWA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: proCaspase-4 , 5 
substrate recognition sequence 

<400> 63 
tgggtaagag ac 



<210> 64 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-4 , 5 
substrate recognition sequence 



<400> 64 
Trp Val Arg Asp 
1 



<210> 65 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: CaBpase-6 
s\ibstrate recognition sequence 

<400> 65 

gtagaaatag ac -^^ 
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<210> 66 
<:211> 4 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
sTibstrate recognition sequence 

<400> 66 
Val Glu lie Asp 
1 



<210> 67 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition seofuence 

<400> 67 

gtagaacacg ac 12 

<210> 68 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 68 
Val Glu His Asp 
1 



<210> 69 

<211> 12 , 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-6 
substrate recognition sequence 

<400> 69 

acagaagtag ac 12 

<210> 70 
<211> 4 
<212> PRT 

<213> Artificial Sequence 



87 



wo 00/50872 



PCT/USOO/04794 



<220> 

<223> Description of Artificial Sequence: proCaspase-6 
substrate recognition sequence 

<400> 70 
Thr Glu Val Asp 
1 



<210> 71 

<211> 12 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 

<400> 71 

atacaagcag ac 12 



<210> 72 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-? 
substrate recognition sequence 



<400> 72 
lie Gin Ala Asp 
1 



<210> 73 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Caspase-S 
substrate recognition sequence 

<400> 73 
gtagaaacag ac 



<210> 74 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
siabstrate recognition sequence 



<400> 74 
Val Glu Thr Asp 
1 
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<210> 75 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspaee-S 
substrate recognition sequence 

<400> 75 
ttagaaacag ac 



<210> 76 
<211> 4 
<212> PRT 

<213> Artificial Sequence 

i 

<220> 

<223> Description of Artificial Sequence: proCaspase-8 
substrate recognition sequence 

<400> 76 
Leu Glu Thr Asp 
1 



<210> 77 

<211> 12 . 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-S 
substrate recognition secpaence 

<400> 77 
ttagaacacg ac 



<210> 78 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-9 
substrate recognition secpience 

<400> 78 
Leu Glu His Asp 
1 



<210> 79 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: proCaspase-9 
substrate recognition sequence 

<400> 79 

ttagaacacg ac 12 



<210> BO 
<211> 4 
<212> PRT 

<213> Artificial Secpience 
<220> 

<223> Description of Artificial Sequence: proCaspase-9 
siibstrate recognition sequence 

<400> BO 

Leu Glu His Asp 

^ i. i 

<210> 81 
<211> 12 
<212> DMA 

<213> Artificial Sequence 
<220^ 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 81 

agccaaaatt ac 12 



<210> 82 
<211> 4 

<212> PRT , \ 

<213> Artificial Sequence ' ' 

<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 82 
Ser Gin Asn Tyr 
1 



<210> 83 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 83 
ccaatagtac aa 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-2 
substrate recognition secjuence 

<400> 57 
gacgaacacg ac 



<210> SB 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 58 

Asp Glu His Asp 
1 



<210> 59 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 59 
gacgaagttg ac 



<210> 60 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 60 

Asp Glu Val Asp 
1 



<210> 61 
<211> 12 
<212> DKTA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 

<4O0> 61 
atagaaacag ac 



<210> 62 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition secpience 

<400> 62 

He Glu Thr Asp i ' 

1 



<210> 63 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of T^tificial Sequence: proCaBpase-4 , 5 
sxibstrate recognition sequence 

<400> 63 
tgggtaagag ac 



<210> 64 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pro.Caspase-4 , 5 
substrate recognition sequence 

<400> 64 
Trp Val Arg Asp 
1 



<:210> 65 

<211> 12 • 
.<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 
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<400> 65 
gtagaaatag ac 



<210> 66 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 66 
Val Glu lie Asp 
1 



<210> 67 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 67 
gtagaacacg ac 

<210> 68 

<211> 4 

<212> PRT / 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 68 
Val Glu His Asp 
1 



<210> 69 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase 
substrate recognition secjuence 

<400> 69 
acagaagtag ac 
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<210> 70 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-6 
substrate recognition sequence 

<400> 70 
Thr Glu Val Asp 
1 



<210> 71 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 

<400> 71 

atacaagcag 'ac ' 12 



<210> 72 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of ^Artif icial Sequence: proGaspase-7 
substrate recognition sequence , 

<:400> 72 
He Gin Ala Asp 
1 



<210> 73 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
substrate recognition sequence 

<400> 73 

gtagaaacag ac 12 



<210> 74 
<211> 4 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence : Caspase-8 
substrate recognition sequence 

<400> 74 
Val Glu Thr Asp 
1 



<210> 75 

<211> 12 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Seqpaence: proCaspase-B 
substrate recognition sequence 

<400> 75 

ttagaaacag ac 12 

<210> 76 
<211> 4 

<212> PRT / 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sec[uence: proCaspase-8 
substrate recognition sequence 

''<400> 76 

Leu Glu Thr Asp . 
1 



<210> 77 
<211> 12 
<212> DUA 

<213> Artificial Sequence 
<220> 

<:223> Description of Artificial Sequence: Caspase-S 
substrate recognition sequence 

<400> 77 

ttagaacacg ac 12 



<210> 78 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Caspase-S 
substrate recognition sequence 

<400> 78 

Leu Glu His Asp 
1 



<210> 79 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspaBe-9 
substrate recognition sequence 

<400> 79 

ttagaacacg ac 12 



<210> BO 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-9 
substrate recognition sequence 

<400> 80 

Leu Glu His Asp 

1 , 



<210> 81 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; HIV protease 
substrate recognition sequence 

<400> 81 

agccaaaatt ac 12 



<210> 82 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 
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<400> 82 
Ser Gin Asn Tyr 
1 



<210> 83 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HTV protease 
siibstrate recognition sec[uence 

<400> 83 
ccaatagtac aa 



<210> 84 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
c220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 84 
Pro lie Val Gin 
1 



<210> ^5 , 
<2X1> 12 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Seqpience: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 85 
atgtttggag ga 



<210> 86 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secpience: Adenovirus 
endopeptidase substrate recognition sequence 

<40O> 86 

Met Phe Gly Gly 
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1 



<210> 87 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 87 
gcaaaaaaaa ga 



<210> 88 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 88 
Ala Lys Lys Arg 
1 



<210> 89 
<211> 9 
<212> DNA 

<213> Artific^^al Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 89 
gtgaaaatg 



<210> 90 

<211> 3 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Secpience: b-Secretase 
substrate recognition sequence 

<400> 90 
Val Lys Met 
1 
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<210> 9X 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
sxibstrate recognition sequence 

<400> 91 
gacgcagaat t c 



<210> 92 
<211> 4 
<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 92 
Asp Ala Glu Phe 
1 



<210> 93 
<211> 15 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cathepsin D 
substrate recognition sequence 

<400> 93 

aaaccagcat tattc 



<210> 94 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cathepsin D 
substrate recognition sequence 

<400> 94 

Lys Pro Ala Leu Phe 
15 



<210> 95 
<211> 9 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cathepsln D 
substrate recognition sequence 

<400> 95 

ttcagatta 9 



<210> 96 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cathepsin D 
substrate recognition sequence 

<400> 96 
Phe Arg Leu 
1 



<210> 97 
<211> 15 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Matrix 

Metalloprotease substrate recognition sequence 

<400> 97 ^ ^ 

ggaccattag gacca - 15 



<210> 9B 
<211> 5 
<212> PRT 

<213>. Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Matrix 

Metalloprotease substrate recognition sequence 

<400> 98 

Gly Pro Leu Gly Pro 
1 5 



<210> 99 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Granzyme B 
substrate recognition sequence 

<400> 99 

atagaaccag ac 12 



<210> 100 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Granzyme B 
substrate recognition sequence 

<400> 100 
lie Glu Pro Asp 
1 



<210> 101 
<211> 36 
<212> DNA 

<213> Artificial Segixence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 

<400> 101 

atgcccaaga agaagccgac gcccatccag ctgaac 36 
<210> 102 

<211> 12 J \ 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 

<400> 102 

Met Pro Iiys Lys Lys Pro Thr Pro lie Gin Leu Asn 
15 10 



<210> 103 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 
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<400> 103 

atgctggccc ggaggaagcc ggtgctgccg gcgctcacca tcaac 45 



<210> 104 
<2.11> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Antlirax 
protease substrate recognition sequence 

<400> 104 

Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu Thr lie Asn 
15 10 15 



<210> 105 
<:211> IB 
<212> DKTA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium sxabstrate recognition sequence 

<400> 105 

gcctcgcagt ttgaaaca 



<210> 106 
<211> 6 

<212> PRT / ^ 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sec[uence: 

tetanus/botulium siibstrate recognition sequence 



<400> 106 

Ala Ser Gin Phe Glu Thr 
1 5 



<210> 107 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium substrate recognition sequence 

<400> 107 

gcttctcaat ttgaaacg 
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<210> 108 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence! 

tetanus /botulium substrate recognition sequence 

<400> 108 

Ala Ser Gin Phe Glu Thr 
1 5 



<210> 109 
<211> IB 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin A substrate recognition sequence 

<400> 109 

gccaaccaac gtgcaaca 



<210> 110 
<211> 6 
<212> PRT 

<213> Artificial Sequence 

<220> ^ 
<223> Description of Artificial Sequence: Botulinum 
neurotoxin A substrate recognition sequence 

<400> 110 

Ala Asn Gin Arg Ala Thr 
1 5 



<210> 111 
<211> IB 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulxnum 
neurotoxin B substrate recognition sequence 

<400> 111 

gcttctcaat ttgaaacg 



<210> 112 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin B substrate recognition sequence 

<400> 112 

Ala Ser Gin Phe Glu Thr 
1 5 



<21Q> 113 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin C substrate recognition sec[uence 

<400> 113 

acgaaaaaag ctgtgaaa 



<210> 114 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin C substrate recognition sequence 

<400> 114 

Thr Lys Lys Ala Val Lys 
1 5 



<210> 115 
<211> IB 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin D substrate recognition sequence 

<400> 115 

gaccagaagc tctctgag 



<210> 116 . 
<211> S 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin D substrate recognition sequence 

<400> 116 

Asp Gin Lys Leu Ser Glu 
1 5 



<210>. 117 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<:220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin E substrate recognition sec[uence 

<400> 117 

atcgacagga tcatggag 



<210> 118 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin E substrate recognition secjuence 

<:400> 118 

lie Asp Arg lie Met Glu 

^1 5 . / 



<210> 119 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin F substrate recognition sequence 

<400> 119 

agagaccaga agctctct 

<210> 120 

<211> 6 

<212> PRT 

<;213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin F substrate recognition sequence 
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<400> 120 

Arg Asp Gin Lys Leu Ser 
1 5 



<210> 121 
<211> 18 
<212> DNA 

<212> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin G substrate recognition sequence 

<400> 121 

acgagcgcag ccaagttg 



<210> 122 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin G substrate recognition sequence 



<400> 122 

Thr Ser Ala Ala Lys Leu 
1 5 



<210> 173 / 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Cytoplasm/ cytoskeleton target sequence 

<400> 123 

atgtctactg tccacgaaat cctgtgcaag ctcagcttgg agggtgttca ttctacaccc 60 
ccaagtgcc 



<21*0> 124 

<211> 23 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: 

Cytoplasm/ cytoskeleton target sequence 
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<400> 124 

Met Ser Thr Val His Glu lie Leu Cys Lys Leu Ser Leu Glu Gly Val 
1 5 10 15 

His Ser Thr Pro Pro Ser Ala 
20 



<210> 125 
<211> 96 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Inner surface 
of plasma membrane target sequence 

<400> 125 

atgggatgta cattaagcgc agaagacaaa gcagcagtag aaagaagcaa aatgatagac 60 
agaaacttaa gagaagacgg agaaaaagct gctaga 96 



<210> 126 
<211> 32 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inner surface 
of plasma membrane target secjuence 

<400> 126 

Met Gly Cys Thr Leu Ser Ala Glu Asp Lys Ala Ala^Val Glu Arg Ser 
1 5 10 15 

Lys Met lie Asp Arg Asn Leu Arg Glu Asp Gly Glu Lys Ala Ala Arg 
20 25 30 



<:210> 127 
<211> 18 

<212> DNA 1 \ 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Nucleus target 
sequence 

<400> 127 

agaaggaaac gacaaaag 



<210> 128 
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<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleus target 
sequence 

<400> 128 

Arg Arg Isya Arg Gin Lys 
1 5 



<210> 129 
<211> 90 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleolus 
target sequence 

<400> 129 

agaaaacgta tacgtactta cctcaagtcc tgcaggcgga tgaaaagaag tggttttgag 60 
atgtctcgac ctattccttc ccaccttact 50 



<210> 130 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
target sequence 

<400> 130 

Arg Lys Arg lie Arg Thr Tyr Leu 
1 5 

Ser Gly Phe Glu Met Ser Arg Pro 
20 



Sequence : Nucleolus 



Lys Ser Cys Arg Arg Met Lys Arg 
10 15 

lie Pro Ser His Leu Thr 
25 30 



<210> 131 
<211> 87 
<212> DlsTA 
<213> Artificial 



Sequence 



<220> . ' »; 

<223> Description of Artificial Secjuence: Mitochondria 
target sequence 

<400> 131 

atgtccgtcc tgacgccgct gctgctgcgg ggcttgacag gctcggcccg gcggctccca 60 
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gtgccgcgcg ccaagatcca ttcgttg 



<210> 132 
<211> 29 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
target sequence 

<4:00> 132 

Met Ser Val Leu Thr Pro Leu Leu 
1 5 

Arg Arg Leu Pro Val Pro Arg Ala 
20 



Sequence : Mitochondria 

Leu Arg Gly Leu Thr Gly Ser Ala 
10 15 

Leu lie His Ser Leu 
25 



<210> 133 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence: Nuclear 
Envelope target sequence 

<400> 133 

atgagcattg ttttaataat tgttattgtg gtgatttttt taatatgttt tttatattta 60 
agcaacagca aagatcccag agtaccagtt gaattaatg 99 



<210> 134 
<211> 33 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear 
Envelope target sequence 

<400> 134 

Met Ser He Val Leu He He Val He Val Val He Phe Leu He Cys 
1 5 10 15 

Phe Leu Tyr Leu Ser Asn Ser Lys Asp Pro Arg Val Pro Val Glu Leu 
20 25 30 

Met 



<210> 135 
<211> 246 
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<212> DKA. 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Golgi target 
sequence 

<400> 135 

atgaggcttc gggagccgct cctgagcggc agcgccgcga tgccaggcgc gtccctacag 60 
cgggcctgcc gcctgctcgt ggccgtctgc gctctgcacc ttggcgtcac cctcgtttac 120 
tacctggctg gccgcgacct gagccgcctg ccccaactgg tcggagtctc cacaccgctg 180 
cagggcggct cgaacagtgc cgccgccatc gggcagtcct ccggggagct ccggaccgga 240 

egggcc 246 



<210> 136 
<211> 82 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Golgi target 
sequence 

<400> 136 

Met Pirg Leu Arg Glu Pro Leu Leu Ser Gly Ser Ala Ala Met Pro Gly 
1 5 10 15 

Ala Ser Leu Gin Arg Ala Cya Arg Leu Leu Val Ala Val Cys Ala Leu 
20 25 30 

His Leu Gly Val Thr Leu Val Tyr Tyr Leu Ala Gly Arg Asp Leu Ser 
35 40 45 

Arg Leu Pro Gin Leu Val Gly Val Ser Thr Pro Leu Gin Gly Gly Ser 
50 55 60 

Asn Ser Ala Ala Ala lie Gly Gin Ser Ser Gly Glu Leu Arg Tlir Gly 
65 70 75 80 

Gly Ala 



<210> 137 
<211> 150 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Endoplasmic 
reticulum target sequence 
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<400> 137 

gaaacaataa gacctataag aataagaaga tgttcttatt ttacatctac agacagcaaa 60 
atggcaattc aattaagatc tccctttcca ttagcattac caggaatgtt agctttatta 120 
ggatggtggt ggtttttcag tagaaaaaaa . 150 



<210> 138 
<211> 50 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Endoplasmic 
reticulum target sequence 

<400> 138 

Glu Thr He Arg Pro He Arg He Arg Arg Cys Ser Tyr Phe Thr Ser 
15 10 15 

Thr Asp Ser Lys Met Ala He Gin Leu Arg Ser Pro Phe Pro Leu Ala 
20 25 30 

Leu Pro Gly Met Leu Ala Leu Leu Gly Trp Trp Trp Phe Phe Ser Arg 
35 40 45 

Lys Lys 
50 



<210> 139 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 139 

gccttgcaga. agaagctgga ggagctagag cttgatgag 39 



<210> 140 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target seq[uence 

<400> 140 

Ala Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu 
15 10 
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<210> 141 
<211> 1024 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Size exclusion 
target sequence 



<400> 141 
gccgacctca 


gtcttgtgga 


tgcgttgaca 


gaaccacctc 


cagaaattga 


gggagaaata 


60 


aagcgagact 


tcatggctgc 


gctggaggca 


gagccctatg 


atgacatcgt 




120 


gtggagaaaa 


ctgagtttat 


tcctctcctg gatggtgatg agaaaaccgg 


gaactcagag 


180 


tccaaaaaga 


aaccctgctt 


agacactagc 


caggttgaag 


gtatcccatc 


ttctaaacca 


240 


acactcctag 


ccaatggtga 


tcatggaatg 


gaggSQS-sta 




gtctccaact 


300 


gacttccttg 


aagagagagt 


ggactatccg 


gattatcaga 


gcagccagaa 


ctggccagaa 


360 


gatgcaagct 


tttgtttcca 


gcctcagcaa 


gtgttagata 


ctgaccaggc 


tgagcccttt 


420 


aacgagcacc 


gtgatgatgg 


tttggcagat 


ctgctctttg 


tctccagtgg 


acccacgaac 


480 


gcttctgcat 


ttacagagcg 


agacaatcct 


tcagaagaca 


gttacggtat 


gcttccctgt 


540 


gactcatttg 


cttccacggc 


tgttgtatct 


caggagtggt 


ctgtgggagc 


cccaaactct 


600 


ccatgttcag 


agtcctgtgt 


ctccccagag gttactatag 


aaaccctaca 


gccagcaaca 


660 


gagctctcca 


aggcagcaga 


agtggaatca 


gtgaaagagc 


agctgccagc 


taaagcattg 


720 


gaaacgatgg 


cagagcagac 


cactgatgtg gtgcactctc 


catccacaga 


cacaacacca 


780 


ggcccagaca 


cagaggcagc 


actggctaaa 


gacatagaag 


agatcaccaa 


gccagatgtg 


840 


atattggcaa 


atgtcacgca 


gccatctact 


gaatcggata 


tgttcctggc 


ccaggacatg 


900 


gaactactca 


caggaacaga 


ggcagcccac gctaacaata 


tcatattgcc 


tacagaacca 


960 


gacgaatctt 


caaccaagga 


tgtagcacca 


cctatggaag 


aagaaattgt 


cccaggcaat 


1020 



gata 1024 



<210> 142 
<:211> 566 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Size exclusion 
target sequence 
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<400> 142 

Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu He 
15 10 15 

Glu Gly Glu He Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu Pro 
20 25 30 

Tyr Asp Asp He Val Gly Glu Thr Val Glu Lys Thr Glu Phe He Pro 
35 40 45 

Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys Lys 
50 55 60 

Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys Pro 
65 70 75 ao 

Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala 
85 90 95 

Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr 
100 105 110 

Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin Pro 
1X5 120 125 

Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His Arg 
130 135 140 

Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn 
145 150 155 160 

Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly 
165 170 175 

Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu 
180 185 . 190 

Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser 
195 200 205 

Pro Glu Val Thr He Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys 
210 215 220 

Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu 
225 230 235 240 

Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser Thr 
245 250 255 

Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp He 
260 265 270 

Glu Glu He Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin Pro 
275 280 285 

Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu Thr 
290 295 300 
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Gly Thr GIu Ala Ala His Ala Asn Asn He lie Leu Pro Thr Glu Pro 
305 310 315 320 

Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu He 
325 330 335 



Val Pro Gly Asn Asp Thr Thr Ser 
340 

Pro He Lys Met Asp Leu Ala Pro 

355 360 

Glu Thr Glu Leu Ala Pro Ala Lys 
370 375 



Pro Lys Glu Thr Glu Thr Thr Leu 
345 350 

Pro Glu Asp Val Leu Leu Thr Lys 
365 

Gly Met Val Ser Leu Ser Glu He 
380 



Glu Glu Ala Leu Ala 
385 

Ala Gin Glu Thr Val 
405 

Val Val Leu Pro Ser 
420 

Leu Pro Leu Glu Ala 
435 

Leu Glu Thr Glu Met 
450 

Thr Asn Leu Gly Met 
465 

Val Thr Leu Gly Lys 
485 



Lys Asn Asp Val Arg 
390 

Val Ser Glu Thr Glu 
410 

Asp Pro He Thr Thr 
425 

Glu Arg Pro Leu Val 
440 

Thr Leu Gly Lys Glu 
455 . 

Ala Lys Asp Met Ser 
470 

Asp Val Val He Leu 
490 



Ser Ala Glu He Pro Val 
395 400 

Val Val Leu Ala Thr Glu 
415 

Leu Thr Lys Asp Val Thr 
430 

Thr Asp Met Thr Pro Ser 
445 

Thr Ala Pro Pro Thr Glu 
460 

Pro Leu Pro Glu Ser Glu 
475 480 

Pro gIu Thr Lys Val Ala 
495 



Glu Phe Asn Asn Val Thr Pro Leu 
500 

Lys Asp Met Ser Pro Ser Ala Glu 
515 520 

Ala Asp Leu His Ser Gly Thr Glu 
530 535 

Pro 2U.a Ser Asp Leu Ala Leu Pro 
545 550 

Pro He Lys Asp Lys Gly 
565 



ser Glu Glu Glu Val Thr Ser Val 
505 510 

Thr Glu Ala Pro Leu Ala Lys Asn. 
525 

Leu He Val Asp Asn Ser Met Ala 
540 

Leu Glu Thr Lys Val Ala Thr Val 
555 560 



<210> 143 
<211> 63 
<212> DMA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial 
membrane target sequence 

<400> 143 

atgtgggcaa tcgggattac tgttctggtt 
gtc 



Sequence: Vesicle 

atcttcatca tcatcatcat cgtgtgggtt 60 

63 



<210> 144 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 144 

Met Trp Ala He Gly He Thr Val Leu Val He Phe lie He He He 
15 10 15 

He Val Trp Val Val *' ' 

20 - 



<210> 145 
<211> 61 
<212> DMA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence \ 

<400> 145 

atgtgggcga tagggatcag tgtcctggtg atcattgtca tcatcatcat cgtgtggtgt 60 
q 61 



<210> 146 
<211> 20 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 146 

Met Trp Ala He Gly He Ser Val Leu Val He He Val He He He 
15 10 15 

He Val Trp Cys 
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20 



<210> 147 
<211> 39 
<212> DNA 

<213> Artificial Sequence " 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 147 

gacctgcaga agaagctgga ggagctggaa cttgacgag 39 



<210> 148 
<211> 13 
<212> PRT 

<213> Artificial Secfuence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 148 

Asp Leu Gin Lys Lys Leu Glu'Glu Leu Glu Leu Asp Glu 
1 5 10 



<210> 149 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peroxisome 
target sequence 

<400> 149 
tctaaactg 



<210> 150 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peroxisome 
target sequence 

<400> 150 
Ser Lys Leu 
1 
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<210> 151 

<211> 3378 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) . . (3375) 

<400> 151 

atg gcc gac etc agt ctt gtg gat gcg ttg aca gaa cca cct cca gaa 48 

Met Ala Asp Leu Ser heu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu 
15 10 15 

att gag gga gaa ata aag cga gac ttc atg get gcg ctg gag gca gag 96 
lie Glu Gly Glu lie Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu 
20 25 30 

ccc tat gat gac ate gtg gga gaa act gtg gag aaa act gag ttt att 144 
Pro Tyr Asp Asp lie Val Gly Glu Thr Val Glu Lys Thr Glu Phe He 
35 40 45 

cct etc ctg gat ggt gat gag aaa ace ggg aac tea gag tec aaa aag 192 
Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys 
50 55 60 

aaa ccc tgc tta gac act age eag gtt gaa ggt ate cca tct tct aaa 240 
Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys 
65 70 75 80 

cca aca etc eta gcc aat ggt gat cat gga atg gag ggg aat aac act 288 
Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr 
85 90 95 

gca ggg tct cca act gac ttc ctt gaa gag aga gtg gac tat ccg gat 336 
Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp 
100 105 110 

tat cag age age cag aac tgg cca gaa gat gca age ttt tgt ttc cag 384 
Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin 
115 120 125 

cct cag caa gtg tta gat act gac cag get gag ccc ttt aac gag cac 432 
Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His 
130 135 -140 

cgt gat gat ggt ttg gca gat ctg etc ttt gtc tec agt gga ccc acg 480 
Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr 
145 150 155 160 

aac get tct gca ttt aca gag cga gac aat cct tea gaa gac agt tac 528 
Asn Ala Ser Ala Phe Thr Glu, Arg Asp Asn Pro Ser Glu Asp Ser Tyr 
165 170 175 

ggt atg ctt ccc tgt gac tea ttt get tec acg get gtt gta tct cag 576 
Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin 
180 185 190 
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gag tgg tct gtg gga gcc cca aac tct cca tgt tea gag tec tgt gtc 624 
Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu Ser Cys Val 
1S5 200 205 

tec cca gag gtt act ata gaa acc eta cag cca gca aca gag etc tec 672 
Ser Pro Glu Val Thr lie Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser 
210 215 220 

aag gca gca gaa gtg gaa tea gtg aaa gag cag ctg cca get aaa gca 720 
Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala 
225 230 235 240 

ttg gaa acg atg gca gag cag acc act gat gtg gtg cac tct cca tec 768 
Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser 
245 250 255 

aca gac aca aca cca ggc cca gac aca gag gca gca ctg get aaa gac 816 
Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp 
260 265 270 

ata gaa gag ate acc aag cca gat gtg ata ttg gca aat gtc acg cag 8 64 
lie Glu Glu lie Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin 
275 280 285 

cca tct act gaa teg gat atg ttc ctg_gce cag gac atg gaa eta etc 912 
Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu 
290 295 300 

aca gga aca gag gca gee cac get aac aat ate ata ttg cet aca gaa 960 
Thr Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu 
305 310 315 320 

cca gac gaa tct tea ace aag gat gta gca cca cet atg gaa gaa gaa 1008 
Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu 
325 330 . 335 

att gtc cca ggc aat gat acg aca tee ccc aaa gaa aca gag aca aca 1056 
lie Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr 
340 345 350 

ett cca ata aaa atg gac ttg gca cca cet gag gat gtg tta ett acc 1104 
Leu Pro lie Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr 
355 360 365 

aaa gaa aca gaa eta gcc cca gcc aag ggc atg gtt tea etc tea gaa 1152 
Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu 
370 375 380 

ata gaa gag get ctg gca aag aat gat gtt cge tct gca gaa ata cet 12 00 
He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro 
385 390 395 400 

gtg get cag gag aca gtg gtc tea gaa aca gag gtg gtc ctg gca aca 1248 
Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr 
405 410 415 
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gaa gtg gta ctg ccc tea gat ccc ata aca aca ttg aca aag gat gtg 1296 
Glu Val Val Leu Pro Ser Asp Pro lie Thr Thr Leu Thr Lys Asp Val 
420 425 430 

aca etc ccc tta gaa gca gag aga ccg ttg gtg acg gac atg act cca 1344 
Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro 
435 440 445 

tct ctg gaa aca gaa atg acc eta ggc aaa gag aca get cca ccc aca 1392 
Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr 
450 455 460 

gaa aca aat ttg ggc atg gcc aaa gac atg tct cca etc cca gaa tea 144 0 
Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser 
465 470 475 480 

gaa gtg act ctg ggc aag gac gtg gtt ata ctt cca gaa aca aag gtg 148 8 
Glu Val Thr Leu Gly Lys Asp Val Val He Leu Pro Glu Thr Lys Val 
485 490 495 

get gag ttt aac aat gtg act cca ctt tea gaa gaa gag gta acc tea 153 6 
Ala Glu Phe Asn Asn Val Tlir Pro Leu Ser Glu Glu Glu Val Thr Ser 
500 505 510 

gtc aag gac atg tct ccg tct gca gaa aca gag get ccc ctg get aag 15 84 
Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys 
5X5 520 525 

aat get gat ctg cac tea gga aca gag ctg att gtg gac aac age atg 1632 
Asn Ala Asp Leu His Ser Gly Thr Glu Leu He Val Asp Asn Ser Met 
530 535 540 

get cca gcc tec gat ctt gca ctg ccc ttg gaa aca aaa gta gca aca 1680 
Ala Pro Ala Ser Asp Leu Ala, Leu Pro Leu Glu Thr Lys Val Ala Thr 
545 550 555 560 

gtt cca att aaa gac aaa gga act gta cag act gaa gaa aaa cca cgt 172 8 
Val Pro He Lys Asp Lys Gly Thr Val Gin Thr Glu Glu Lys Pro Arg 
565 570 575 

gaa gac tec cag tta gca tct atg cag cac aag gga cag tea aca gta 1776 
Glu Asp Ser Gin Leu Ala Ser Met Gin His Lys Gly Gin Ser Thr Val 
580 585 590 

cct cet tgc acg get tea cca gaa cca gtc aaa get gca gaa caa atg 1824 
Pro Pro Cys Thr Ala Ser Pro Glu Pro Val Lys Ala Ala Glu Gin Met 
595 600 605 

tct acc tta cca ata gat gca cct tct cca tta gag aac tta gag cag 1872 
Ser Thr Leu Pro He Asp Ala Pro Ser Pro Leu Glu Asn Leu Glu Gin 
610 615 620 

aag gaa acg cct ggc age cag cct tct gag cct tgc tea gga gta tec . 1920 
Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu Pro Cys Ser Gly Val Ser 
625 630 ^ 635 640 

egg caa gaa gaa gca aag get get gta ggt gtg act gga aat gac ate 1968^ 
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Arg Gin Glu Glu Ala Lys Ala Ala Val Gly Val Thr Gly Asn Asp He 
645 650 655 

act acc ccg cca aac aag gag cca cca cca age cca gaa aag aaa gca 
Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro Ser Pro Glu Lys Lys Ala 
660 665 670 

aag cct ttg gcc acc act caa cct gca aag act tea aca teg aaa gcc 
Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys Thr Ser Thr Ser Lys Ala 
675 680 685 



ggt ggg ttg aat aaa aaa ccc atg age etc gcc tea ggc tea gtg cca 
Gly Gly Leu Asn Lys Lys Pro Met Ser Leu Ala Ser Gly Ser Val Pro 
705 710 715 720 

get gcc cca cac aaa cgc cct get get gcc act get act gcc agg cct 
Ala Ala Pro His Lys Arg Pro Ala Ala Ala Thr Ala Thr Ala Arg Pro 
725 730 735 



atg tct gca cct age cgc tct tct ggg get ctt tct gtg gac aag aag 
Met Ser Ala Pro Ser Arg Ser Ser Gly Ala Leu Ser Val Asp Lys Lys 



2016 



2064 



aaa aca cag ccc act tct etc cct aag caa cca get ccc acc acc tct 2112 
Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin Pro Ala Pro Thr Thr Ser 
690 695 700 



2160 



2208 



tec acc eta cct gcc aga gac gtg aag cca aag cca att aca gaa get 2256 
Ser Thr Leu Pro Ala Arg Asp Val Lys Pro Lys Pro He Thr Glu Ala 
740 745 . 750 

aag gtt gcc gaa aag egg acc tct cca tec aag cct tea tct gcc cca 2304 
Lys Val Ala Glu Lys Arg Thr Ser Pro Ser Lys Pro Ser Ser Ala Pro 
755 760 765 

gcc etc aaa cct gga cct aaa acc acc cca acc gtt tea aaa gcc aca 
Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro Thr Val Ser Lys Ala Thr 
770 . 775 y 780 

tct ccc tea act ctt gtt tec act gga cca ,agt agt aga agt cca get 
Ser Pro Ser Thr Leu Val Ser Thr Gly Pro Ser Ser Arg Ser Pro Ala 
785 790 795 800 

aca act ctg cct aag agg cca acc age ate aag act gag ggg aaa cct 2448 
Thr Thr Leu Pro Lys Arg Pro Thr Ser lie Lys Thr Glu Gly Lys Pro 
805 BIO 815 



2352 



2400 



2496 



2544 



get gat gtc aaa agg atg act get aag tct gcc tea get gac ttg agt 

Ala Asp Val Lys Arg Met Thr Ala Lys Ser Ala Ser Ala Asp Leu Ser 
820 825 830 

cgc tea aag acc acc tct gcc agt tct gtg aag aga aac acc act ccc 

Arg Ser Lys Thr Thr Ser Ala Ser Ser Val Lys Arg TVsn Thr Thr Pro 
835 840 845 

act ggg gca gca ccc cca gca ggg atg act tec act ega gtc aag ccc 2592 

Thr Gly Ala Ala Pro Pro Ala Gly Met Thr Ser Thr Arg Val Lys Pro 
850 855 860 



2640 
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865 870 875 880 

ccc act tec act aag cct age tec tct get ccc agg gtg age cge ctg 2688 
Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala Pro Arg Val Ser Arg Leu 
685 , 890 895 

gee aca act gtt tct gcc cct gac ctg aag agt gtt cgc tec aag gtc 2736 
Ala Thr Thr Val Ser Ala Pro Asp Leu Lys Ser Val Arg Ser Lys Val 
900 905 910 

ggc tct aca gaa aac ate aaa cac eag cct gga gga ggc egg gee aaa 2784 
Gly Ser Thr Glu Asn He Lys His Gin Pro Gly Gly Gly Arg Ala Lys 
915 920 925 

gta gag aaa aaa aca gag gca get ace aca get ggg aag cct gaa cct 2832 
Val Glu Lys Lys Thr Glu Ala Ala Thr Thr Ala Gly Lys Pro Glu Pro 
930 935 940 

aat gca gtc act aaa gca gee ggc tee att gcg agt gca eag aaa ecg 2880 
Aan Ala Val Thr Lys Ala Ala Gly Ser lie Ala Ser Ala Gin Lys Pro 
945 950 955 960 

cct get ggg aaa gtc eag at a gta tee aaa aaa gtg age tac agt cat 2928 
Pro Ala Gly Lys Val Gin He Val Ser Lys Lys Val Ser Tyr Ser His 
965 970 . 975 

att caa tee aag tgt gtt tec aag gac aat att aag cat gtc cct gga 2976 
He Gin Ser Lys Cys Val Ser Lys Asp Asn He Lys His Val Pro Gly 
980 985 990 

tgt ggc aat gtt cag att eag aac aag aaa gtg gac ata tec aag gtc 3024 
Cys Gly Asn Val Gin He Gin Asn Lys Lys Val Asp lie Ser Lys Val 
995 1000 1005 

tec tee aag tgt ggg tee aaa get aat ate aag cac aag cct ggt gga 3072 
Ser Ser Lys Cys Gly Ser Lys Ala Asn He .Lys His Lys Pro Gly Gly 
1010 1015 1020 

gga gat gtc aag att gaa agt cag aag ttg aac ttc aag gag aag gcc 3120 
Gly Asp Val Lys He Glu Ser Gin Lys Leu Asn Phe Lys Glu Lys Ala 
1025 1030 1035 1040 

caa gee aaa gtg gga tec ett gat aac gtt ggc cac ttt cct gca gga 3168 
Gin Ala Lys Val Gly Ser Leu Asp Asn Val Gly His Phe Pro Ala Gly 
1045 1050 1055 

ggt gee gtg aag act gag ggc ggt ggc agt gag gcc ctt ecg tgt cca 3216 
Gly Ala Val Lys Thr Glu Gly Gly Gly Ser Glu Ala Leu Pro Cys Pro 
1060 1065 1070 

ggc ccc ccc get ggg gag gag cca gtc ate cct gag get gcg cct gac 3264 
Gly Pro Pro Ala Gly Glu Glu Pro Val He Pro Glu Ala Ala Pro Asp 
1075 1080 1085 

cgt ggc gcc cct act tea gcc agt ggc etc agt ggc cac ace ace ctg 3312 
Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu Ser Gly His Thr Thr Leu 
1090 1095 1100 
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tea ggg ggt ggt gac caa agg gag ccc cag acc ttg gac age cag ate 3360 
Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin Thr Leu Asp Ser Gin lie 
1105 1110 1115 1120 



cag gag aca age ate taa 
Gin Glu Thr Ser lie 
1125 



<210> 152 

<211> 1125 

<212> PRT 

<213> MUB mus cuius 



<400> 152 

Met Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu 
1 5 10 15 

He Glu Gly Glu He Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu 
20 25 30 

Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu Lys Thr Glu Phe He 
35 40 45 

Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys 
50 ' 55 60 ■ 

Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys 
65 70 75 BO 

Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr 
85 90 95 

Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp 
100 105 110 

Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin 
115 120 125 

Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His 
130 135 140 

Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr 
145 150 155 ISO 

Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr 
165 170 175 

Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin 
180 185 190 

Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu. Ser Cys Val 
X95 200 205 

Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser 
210 215 220 
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Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala 
225 230 235 240 

Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser 
245 250 255 

Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp 
260 265 270 

He Glu Glu He Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin 
275 280 285 

Pro ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu 
290 295 300 

Thr Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu 
305 310 315 320 

Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu 
325 330 335 

He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr 
340 345 350 

Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu- Leu Thr 
355 360 365 

Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu 
370 375 380 

He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro 
385 390 395 400 

Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr 
405 410 . 415 

Glu Val Val Leu Pro Ser Asp Pro He Thr Thr Leu Thr Lys Asp Val 
420 425 430 

Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro 
435 440 445 

Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr 
450 455 460 

Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser 
465 470 475 480 

Glu Val Thr Leu Gly Lys Asp Val Val He Leu Pro Glu Thr Lys Val 
485 490 495 

Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser 
50O S05 510 

Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys 
515 520 525 
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Asn Ala Asp Leu -His Ser Gly Thr Glu Leu lie Val Asp Asn Ser Met 
530 535 540 

Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu Thr Lys Val Ala Thr 
545 550 555 560 

Val Pro lie Lys Asp Lys Gly Thr Val Gin Thr Glu Glu Lys Pro Arg 
565 570 575 

Glu Asp Ser Gin Leu Ala Ser Met Gin His Lys Gly Gin Ser Thr Val 
580 585 590 

Pro Pro Cys Thr Ala Ser Pro Glu Pro Val Lys Ala Ala Glu Gin Met 
595 600 605 

Ser Thr Leu Pro He Asp Ala Pro Ser Pro Leu Glu Asn Leu Glu Gin 
610 615 620 

Lys Glu Thr Pro Gly' Ser Gin Pro Ser Glu Pro Cys Ser Gly Val Ser 
625 630 635 640 

Arg Gin Glu Glu Ala Lys Ala Ala Val Gly Val Thr Gly Asn Asp He 
645 650 655 

Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro Ser Pro Glu Lys Lys Ala 
660 665 . 670 

Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys Thr Ser Thr Ser Lys Ala 
675 680 665 

Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin Pro Ala Pro Thr Thr Ser 
690 695 700 

Gly Gly Leu Asn Lys Lys Pro Met Ser Leu Ala Ser Gly Ser Val Pro 
705 710 .715 720 

Ala Ala Pro His Lys Arg Pro Ala Ala Ala Thr Ala Thr Ala Arg Pro 
725 730 735 

Ser Thr Leu Pro Ala Arg Asp Val Lys Pro Lys Pro He Thr Glu Ala 
740 745 750 

Lys Val Ala Glu Lys Arg Thr Ser Pro Ser Lys Pro Ser Ser Ala Pro 
755 760 765 

Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro Thr Val Ser Lys Ala Thr 
770 775 780 

Ser Pro Ser Thr Leu Val Ser Thr Gly Pro Ser Ser Arg Ser Pro Ala 
785 790 795 800 

Thr Thr Leu Pro Lys Arg Pro Thr Ser He Lys Thr Glu Gly Lys Pro 
805 810 815 

Ala Asp Val Lys Arg Met Thr Ala Lys Ser Ala Ser Ala Asp Leu Ser 
820 825 830 
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Arg Ser Lys Thr Thr Ser Ala Ser Ser Val Lys Arg Aen Thr Thr Pro 
835 840 845 

Thr Gly Ala Ala Pro Pro Ala Gly Met Thr Ser Thr Arg Val Lys Pro 
850 855 860 

Met Ser Ala Pro Ser Arg Ser Ser Gly Ala Leu Ser Val Asp Lys Lys 
865 870 875 880 

Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala Pro Arg Val Ser Arg Leu 
885 890 895 

Ala Thr Thr Val Ser Ala Pro Asp Leu Lys Ser Val T^g Ser Lys Val 
900 905 910 

Gly Ser Thr Glu Asn lie Lys His Gin Pro Gly Gly Gly Arg Ala Lys 
915 920 925 

Val Glu Lys Lys Thr Glu Ala Ala Thr Thr Ala Gly Lys Pro Glu Pro 
930 935 940 

Asn Ala Val Thr Lys Ala Ala Gly Ser lie Ala Ser Ala Gin Lys Pro 
945 950 955 960 

Pro Ala Gly~ Lys Val Gin lie Val Ser Lys Lys Val Ser Tyr Ser His 
965 970 975 

lie Gin Ser Lys Cys Val Ser Lys Asp Asn lie Lys His Val Pro Gly 
980 985 990 

Cys Gly Asn Val Gin He Gin Asn Lys Lys Val Asp He Ser Lys Val 
995 1000 1005 

Ser Ser Lys Cys Gly Ser Lys Ala Asn He Lys His Lys Pro Gly Gly 
1010 1015 1020 

Gly Asp Val Lys He Glu Ser Gin Lys Leu Asn Phe Lys Glu. Lys Ala 
1025 . 1030 1035 1040 

Gin Ala Lys Val Gly Ser Leu Asp Asn Val Gly His Phe Pro Ala Gly 
1045 1050 1055 

Gly Ala Val Lys Thr Glu Gly Gly Gly Ser Glu Ala Leu Pro Cys Pro 
1060 1065 1070 

Gly Pro Pro Ala Gly Glu Glu Pro Val He Pro Glu Ala Ala Pro Asp 
1075 1080 1085 

Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu Ser Gly His Thr Thr Leu 
1090 1095 1100 

Ser Gly Gly Gly -Asp Gin Arg Glu Pro Gin Thr Leu Asp Ser Gin He 
1105 1110 1115 1120 

Gin Glu Thr Ser He 
1125 
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<210> 153 
<211> 96 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 153 

tcatcatccg gagctggagc cggagctggc cgatcggctg ttaaatctga aggaaagaga 60 
aagtgtgacg aagttgatgg aattgatgaa gtagca 96 



<210> 154 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 154 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtacttgc tacttcatc 99 



<210> 155 

<211> IB 

<212> DNA 

<213> Artificial Sequence ; 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 155 

tcatcatccg gagctgga 18 



<210> 156 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 156 

gaagaaggat ccggcact 18 
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<210> 157 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 157 

tcatcatccg gaagaaggaa acgacaaaag cgatcggctg ttaaatctga aggaaagaga 60 
aagtgtgacg aagttgatgg aattgatgaa gtagca 96 



<210> 158 
<211> 18 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: 
ol igonuc 1 eo t ide 

" <400> 158 
tcatcatccg gaagaagg 



<210> 159 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 159 

tcatcatccg gaagaaggaa acgacaaaag cgatcgacaa gacttgttga aattgacaac 60 



<210> 160 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 160 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtactgtt gtcaatttc 99 
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<210> 161 
<211> 84 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 161 

tcatcatccg gaagaaggaa acgacaaaag cgatcgtatc aaaaaggaat accagttgaa 60 
acagacagcg aagagcaacc ttat 84 



<210> 162 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 162 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtactata aggttgctc 99 



<210> 163 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Secjuence: 
oligonucleotide 

<400> 163 

tcatcatccg gaagaaaacg tatacgtact tacctcaagt cctgcaggcg gatgaaaaga 60 



<210> 164 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 164 

gaagaacgat cgagtaaggt gggaaggaat aggtcgagac atctcaaaac cacttctttt 60 
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<210> 165 
<211> 18 
<212> DHA 

<:213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 165 

tcatcatccg gaagaaaa 18 



<210> 166 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 166 

gaagaacgat cgagtaag 18 



<210> 167 
<211> 14 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: Caspase-1, 4 , 5 
siibstrate recognition sequence 

<400> 167 

ttagaacatg acaa 14 



<210> 168 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence: Caspase-1, 4, 5 
substrate recognition sequence 

<400> 168 
Leu Glu His Asp 
1 

<210> 169 

<211> 1380 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: GFP-HSP27 

<220> 

<:221> CDS 

<222> (1) . . (1380) 



48 



<400> 169 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val C31n Glu 
85 90 95 

cgc ace ate tote ttc aag gac gac ggc aac tac aag acc cgc gee gag 33 6 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 . 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 ISO 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac ace ccc ate ggc gac ggc 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

^ age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct cga gcg geg tec aga gca gag tea gcc age atg acc 768 
Gly Leu Arg Ser Arg Ala Ala Ser Arg Ala Glu Ser Ala Ser Met Thr 
245 250 255 

gag cgc cgc gtc ccc ttc teg etc ctg egg ggc ccc age tgg gac ccc 816 
Glu Arg Arg Val Pro Phe Ser Leu Leu Arg Gly Pro Ser Trp Asp Pro 
260 265 270 

ttc cgc gae tgg tac ccg cat age cgc etc ttc gac eag gcc ttc ggg 864 
Phe Arg Asp Trp Tyr Pro His Ser Arg Leu Phe Asp Gin Ala Phe Gly 
275 280 285 

ctg ccc egg ctg ccg gag gag t^g" teg cag tgg tta ggc ggc age age 912 
Leu Pro Arg Leu Pro Glu Glu Trp Ser Gin Trp Leu Gly Gly Ser Ser 
290 295 300 

tgg cea ggc tac gtg cgc ccc ctg cec ccc gcc gcc ate gag age ece 960 
Trp Pro Gly Tyr Val Arg Pro Leu Pro Pro Ala Ala lie Glu Ser Pro 
305 310 315 320 

gca gtg gee gcg cec gcc tac age cgc gcg etc age egg caa etc age 1008 
Ala Val Ala Ala-^ Pro Ala Tyr Ser Arg Ala Leu Ser Arg Gin Leu Ser 
325 330 335 

age ggg gtc teg. gag ate egg cac act geg gac cgc tgg cgc gtg tee 1056 
Ser Gly Val Ser Glu lie Arg His Thr Ala Asp Arg Trp Arg Val Ser 
340 . . 345 350 

ctg gat gtc aac cac tte gcc ccg gac gag ctg acg gtc aag acc aag 1104 
Leu Asp Val Asn His Phe Ala Pro. Asp Glu Leu Thr Val Lys Thr Lys 
355 360 365 

gat ggc gtg gtg gag ate ace ggc aag cac gag gag egg cag gac gag 1152 
Asp Gly Val Val Glu lie Thr Gly Lys His Glu Glu Arg Gin Asp Glu 
370 375 380 

eat ggc tac ate tec egg tge tte acg egg aaa tac acg ctg ccc ccc 1200 
His Gly Tyr He Ser Arg Cys' Phe Thr Arg Lys Tyr Thr Leu Pro Pro 
385 390 395 400 

ggt gtg gac ccc acc caa gtt tee tec tec ctg tec cet gag ggc aca 1248 
Gly Val Asp Pro Thr Gin Val Ser Ser Ser Leu Ser Pro Glu Gly Thr 
405 410 415 

ctg acc gtg gag gcc cec atg ccc aag eta gcc acg eag tec aac gag 1296 
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lisu Thr Val Glu Ala Pro Met Pro Lys Leu Ala Thr Gin Ser Asn Glu 
420 425 430 

ate acc ate oca gtc acc ttc gag teg egg gcc cag ctt ggg ggc cca 1344 

He Thr He Pro Val Thr Phe Glu Ser Arg Ala Gin Leu Gly Gly Pro 
435 440 445 

gaa get gca aaa tec gat gag act gcc gcc aag taa 1380 

Glu Ala Ala Lys Ser Asp Glu Thr Ala Ala Lys 

450 455 460 



<210> 170 
<211> 459 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSP27 
<400> 170 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin <:ys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
1X5 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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. Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220^ - 

Val Thr Ala Ala Gly lie Thx Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Ala Ser Arg Ala Glu Ser Ala Ser Met Thr 
245 250 255 

Glu Arg Arg Val Pro Phe Ser Leu Leu Arg Gly Pro Ser Trp Asp Pro 
260 265 270 

Phe Arg Asp Trp Tyr Pro His Ser Arg Leu Phe Asp Gin Ala' Phe Gly 
275 280 285 

Leu Pro Arg Leu Pro Glu Glu Trp Ser Gin Trp Leu Gly Gly Ser Ser 
290 295 300 

Trp Pro Gly Tyr Val Arg Pro Leu Pro Pro Ala Ala He Glu Ser Pro 
305 310 315 320 

Ala Val Ala Ala Pro Ala Tyr Ser Arg Ala Leu Ser Arg Gin Leu Ser 
325 330 335 

Ser Gly Val Ser Glu lie Arg His Thr Ala Asp Arg Trp Arg Val Ser 
340 345 ' 350 

Leu Asp Val Asn His Phe Ala Pro Asp Glu Leu Thr Val Lys Thr Lys 
355 360 365 

Asp Gly Val Val Glu He Tlir Gly Lys His Glu Glu Arg Gin Asp Glu 
370 375 380 

His Gly Tyr He Ser Arg Cys Phe Thr Arg Lys Tyr Thr Leu Pro Pro 
385 390 395 400 

Gly Val Asp Pro Thr Gin Val Ser Ser Ser Leu Ser Pro Glu Gly Thr 
405 410 415 

Leu Thr Val Glu Ala Pro Met Pro Lys Leu Ala Thr Gin Ser Asn Glu 
420 425 430 

He Thr He Pro Val Thr Phe Glu Ser Arg Ala Gin Leu Gly Gly Pro 
435 440 445 

Glu Ala Ala Lys Ser Asp Glu Thr Ala Ala Lys 
450 455 



<210> 171 
<211> 2823 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: GFP-HSP70 

<220> 

<221> CDS 

<222> (1) , . (2823) 

<400> 171 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Tlir Gly Val Val Pro lie Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gas 9gc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cge tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cge ace ate ttc ttc aag gac gac ggc aac tac aag acc cge gee gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu / 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cge ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr- 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 4 80 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cge cac aac ate gag gac ggc age 52 8 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tee gee ctg 624 
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Pro Val Leu Leu Pro Asp Asn His Tyx Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga atg teg gtg gtg ggc ata gac ctg ggc ttc eag age tgc tac gtc 768 
Gly Met Ser Val Val Gly He Asp Leu Gly Phe Gin Ser Cys Tyr Val 
245 250 255 

get gtg gcc cgc gcc ggc ggc ate gag act ate get aat gag tat age 816 
Ala Val Ala Arg Ala Gly Gly He Glu Thr He Ala Asn Glu Tyr Ser 
260 265 270 

gac cgc tgc acg ccg get tgc att tet ttt ggt cet aag aat cgt tea 864 
Asp Arg Cys Thr Pro Ala Cys He Ser Phe Gly Pro Lys Asn Arg Ser 
275 280 285 

att gga gca gca get aaa age eag gta att tct aat gea aag aac aca 912 
He Gly Ala Ala Ala Lys Ser Gin Val He Ser Asn Ala Lys Asn Thr 
290 295 300 

gtc caa gga ttt aaa aga ttc cat ggc cga gca ttc tct gat eea ttt 960 
Val Gin Gly Phe Lys Arg Phe His Gly Arg Ala Phe Ser Asp Pro Phe 
305 310 315 320 

gtg gag gca gaa aaa tct aac ctt gca tat gat att gtg eag tgg cet 1008 
Val Glu Ala Glu Lys Ser Asn Leu Ala Tyr Asp He Val Gin Trp Pro 
325 -330 335 

aca gga tta aca ggt ata aag gtg aca tat .atg gag gaa gag ega aat 1056 
Thr Gly Leu Thr Gly He Lys Val Thr Tyr Met Glu Glu Glu Arg Asn 
340 345 350 

ttt acc act gag caa gtg act gcc atg ctt ttg tee aaa ctg aag gag 1104 
Phe Thr Thr Glu Gin Val Thr Ala Met Leu Leu Ser Lys Leu Lys Glu 
355 360 365 . 

aca gcc gaa agt gtt ctt aag aag cet gta gtt gac tgt gtt gtt teg 1152 
Thr Ala Glu Ser Val Leu Lys Lys Pro Val Val Asp Cys Val Val Ser 
370 375 380 

gtt cet tgt ttc tat act gat gea gaa aga cga tea gtg atg gat gca 1200 
Val Pro Cys Phe Tyr Thr Asp Ala Glu Arg Arg Ser Val Met Asp Ala 
385 390 395 400 

aca eag att get ggt ett aat tgc ttg cga tta atg aat gaa acc act 124 8 
Thr Gin He Ala Gly Leu Asn Cys Leu Arg Leu Met Asn Glu Thr Thr 
405 410 415 

gca gtt get ctt gca tat gga ate tat aag cag gat ctt cet cgc tta 1296 
Ala Val Ala Leu Ala Tyr Gly He Tyr Lys Gin Asp Leu Pro Arg Leu 
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420 425 430 

gaa gag aaa cca aga aat gta gtt ttt gta gac atg ggc cac tct get 1344 

Glu Glu Lys Pro Arg Asn Val Val Phe Val Asp Met Gly His Ser Ala 
435 440 445 

tat caa gtt tct gta tgt gca ttt aat aga gga aaa ctg aaa gtt ctg 1392 

Tyr Gin Val Ser Val Cys Ala Phe Asn Arg Gly Lys Leu Lys Val Leu 
450 455 450 

gcc act gca ttt gac acg aca ttg gga ggt aga aaa ttt gat gaa gtg 1440 

Ala Thr Ala Phe Asp Thr Thr Leu Gly Gly Arg Lys Phe Asp Glu Val 

465 470 475 480 

tta gta aat cac ttc tgt gaa gaa ttt ggg aag aaa tac aag eta gac 1488 

Leu Val Asn His Phe Cys Glu Glu Phe Gly Lys Lys Tyr Lys Leu Asp 
485 490 495 

att aag tec aaa ate cgt gca tta tta cga etc tct cag gag tgt gag 1536 

lie Lys Ser Lys lie Arg Ala Leu Leu Arg Leu Ser Gin Glu Cys Glu 

i-rtt- ein 



500 505 510 

aaa etc aag aaa ttg atg agt gca aat get tea gat etc cet ttg age 

Lys Leu Lys Lys Leu Met Ser Ala Asn Ala Ser Asp Leu Pro Leu Ser 

515 520 525 

att gaa tgt ttt atg aat gat gtt gat gta tct gga act atg aat aga 

He Glu Cys Phe Met Asn Asp Val Asp Val Ser Gly Thr Met Asn Arg 

530 535 540 

ggc aaa ttt ctg gag atg tgc aat gat etc tta get aga gtg gag cea 

Gly Lys Phe Leu Glu Met Cys Asn Asp Leu Leu Ala Arg Val Glu Pro 

545 550 555 560 

cca ett cgt agt gtt ttg gaa caa ace aag tta aag aaa gaa gat att 

Pro Leu Arg Ser Val Leu Glu Gin Thr Lys ieu Lys Lys Glu Asp He 

565 570 575 

tat gca gtg gag ata gtt ggt ggt get aca cga ate cet gcg gta aaa 

Tyr Ala Val Glu He Val Gly Gly Ala Thr Arg He Pro Ala Val Lys 

580 585 590 



get gat gaa get gtc act cga ggc tgt gca ttg cag tgt gcc ate tta 
Ala Asp Glu Ala Val Thr Arg Gly Cys Ala Leu Gin Cys Ala He Leu 
610 615 620 

teg cet get ttc aaa gtc aga gaa ttt tct ate act gat gta gta cca 
Ser Pro Ala Phe Lys Val Arg Glu Phe Ser He Thr Asp Val Val Pro 
625 630 635 S40 



1584 



1632 



1680 



1728 



1776 



gag aag ate age aaa ttt ttc ggt aaa gaa ett agt aca aca tta aat 1824 
Glu Lys He Ser Lys Phe Phe Gly Lys Glu Leu Ser Thr Thr Leu Asn 
595 600 605 



1872 



1920 



tat cea ata tct ctg aga tgg aat tct cea get gaa gaa ggg tea agt 1968 
Tyr Pro He Ser Leu Arg Trp Asn Ser Pro Ala Glu Glu Gly Ser Ser 
645 650 655 
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gac tgt gaa gtc ttt tec aaa aat cat get get cct ttc tct aaa gtt 2016 
Asp Cys Glu Val Phe Ser Lys Asn His Ala Ala Pro Phe Ser Lys Val 
660 665 670 

ctt aca ttt tat aga aag gaa cct ttc act ctt gag gcc tac tac age 2 064 
Leu Thr Phe Tyr Arg Lys Glu Pro Phe Thr Leu Glu Ala Tyr Tyr Ser 
675 680 685 

tct cct cag gat ttg ccc tat cca gat cct get ata get cag ttt tea 2112 
Ser Pro Gin Asp Leu Pro Tyr Pro Asp Pro Ala He Ala Gin Phe Ser 
690 \ 695 700 

gtt cag aaa gtc act cct cag tct gat ggc tec agt tea aaa gtg aaa 2160 

Val Gin Lys Val Thr Pro Gin Ser Asp Gly Ser Ser Ser Lys Val Lys 
705 710 715 720 

gtc aaa gtt cga gta aat gtc cat ggc att ttc agt gtg tec agt gca 22 08 

Val Lys Val Arg Val Asn Val His Gly He Phe Ser Val Ser Ser Ala 

725 730 735 

tct tta gtg gag gtt cac aag tct gag gaa aat gag gag cca atg gaa 2256 

Ser Leu Val Glu Val His Lys Ser Glu Glu Asn Glu Glu Pro Met . Glu 
740 745 .750 

aca gat cag aat gca aag gag gaa gag aag atg caa gtg gac cag gag 23 04 

Thr Asp Gin Asn Ala Lys Glu Glu Glu Lys Met Gin Val Asp Gin Glu 
755 760 765 

gaa cca cat gtt gaa gag caa cag cag cag aca cca gca gaa aat aag 23 52 

Glu Pro His Val Glu Glu Gin Gin Gin Gin Thr Pro Ala Glu Asn Lys 
770 775 780 

gca ga^ tct gaa gaa atg gag acc tct caa get §ga tec aag gat aaa 24 00 

Ala Glu Ser Glu Glu Met Glu Thr Ser Gin Ala Gly Ser Lys Asp Lys 
785 790 .795 800 

aag atg gac caa cca ccc caa tgc caa gaa ggc aaa agt gaa gac cag 2448 

Lys Met Asp Gin Pro Pro Gin Cys Gin Glu Gly Lys Ser Glu Asp Gin 

805 810 815 

tac tgt gga cct gcc aat cga gaa tea get ata tgg cag ata gac aga 2496 

Tyr Cys Gly Pro Ala Asn Arg Glu Ser Ala He Trp Gin He Asp Arg 
820 825 830 

gag atg etc aac ttg tac att gaa aat gag ggt aag atg ate atg cag 2544 

Glu Met Leu Asn Leu Tyr He Glu Asn Glu Gly Lys Met He Met Gin 
835 840 845 

gat aaa ctg gag aag gag egg aat gat get aag aac gca gtg gag gaa 2592 

Asp Lys Leu Glu Lys Glu Arg Asn Asp Ala Lys Asn Ala Val Glu Glu 
850 855 860 

tat gtg tat gaa atg aga gac aag ctt agt ggt gaa tat gag aag ttt 2640 

Tyr Val Tyr Glu Met Arg Asp Lys Leu Ser Gly Glu Tyr Glu Lys Phe 
865 870 875 880 
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gtg agt gaa gat gat cgt aac agt ttt act ttg aaa ctg gaa gat act 268 B 

Val Ser Glu Asp Asp Arg Aen Ser Phe Thr Leu Lys Leu Glu Asp Thr 
BBS 890 895 

gaa aat tgg ttg tat gag gat gga gaa gac cag cca aag caa gtt tat 2736 

Glu Asn Trp Leu Tyr Glu Asp Gly Glu Asp Gin Pro Lys Gin Val Tyr 

900 905 910 

gtt gat aag ttg get gaa tta aaa aat eta ggt caa cot att aag ata 2784 

Val Asp Lys Leu Ala Glu Leu Lys Asn Leu Gly Gin Pro lie Lys lie 
915 920 925 



cgt ttc cag gaa tct gaa gaa cga cca aat tat ttg aag 
Arg Phe Gin Glu Ser Glu Glu Arg Pro Asn Tyr Leu Lys 
930 935 940 



<210> 172 
<211> 941 
<212> PRT 

<213> Artificial Sequence 
c;220> 

<:223> Description of Artificial Sequence: GFP-HSP70 
<400> 172 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
3S 40 45/ 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp .Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 BO 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 ISO 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
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165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 -220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Met Ser Val Val Gly He Asp Leu Gly Phe Gin Ser Cys Tyr Val 
245 250 255 

Ala Val Ala Arg Ala Gly Gly He Glu Thr He Ala Asn Glu Tyr Ser 
260 265 270 

Asp Arg Cys Thr Pro Ala Cys He Ser Phe Gly Pro Lys Asn Arg Ser 
275 280 285 

He Gly Ala Ala Ala Lys Ser Gin Val lie Ser Asn Ala Lys Asn Thr 
290 295 300 

Val Gin Gly Phe Lys Arg Phe His Gly Arg Ala Phe Ser Asp Pro Phe 
305 310 315 320 

Val Glu Ala Glu Lys Ser Asn Leu Ala Tyr Asp He Val Gin Trp Pro 
325 330 335 

Thr Gly Leu Thr Gly He Lys Val Thr Tyr Met Glu Glu Glu Arg Asn 
340 345 3S0 

Phe Thr Thr Glu Gin Val Thr Ala Met Leu Leu Ser Lys Leu Lys Glu 
355 360 365 

Thr Ala Glu Ser Val Leu Lys Lys Pro Val Val Asp Cys Val Val Ser 
370 375 380 

Val Pro Cys Phe Tyr Thr Asp Ala Glu Arg Arg Ser Val Met Asp Ala 
385 390 395 400 

Thr Gin He Ala Gly Leu Asn Cys Leu Arg Leu Met Asn Glu Thr Thr 
405 410 415 

Ala Val Ala Leu Ala Tyr Gly He Tyr Lys Gin Asp Leu Pro Arg Leu 
420 425 430 

Glu Glu Lys Pro Arg Asn Val Val Phe Val Asp Met Gly His Ser Ala 
435 440 445 

Tyr Gin Val Ser Val Cys Ala Phe Asn Arg Gly Lys Leu Lys Val Leu 
450 4.55 460 

Ala Thr Ala Phe Asp Thr Thr Leu Gly Gly Arg Lys Phe Asp Glu Val 
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465 470 475 4B0 

Leu Val Asn His Phe Cys Glu Glu Phe Gly Lys Lys Tyr Lys Leu Asp 
485 490 495 • 

He Lys Ser Lys He Arg Ala Leu Leu Arg Leu Ser Gin Glu Cys Glu 
500 505 510 

Lys Leu Lys Lys Leu Met Ser Ala Asn Ala Ser Asp Leu Pro Leu Ser 
515 520 525 

He Glu Cys Phe Met Asn Asp Val Asp Val Ser Gly Thr Met Asn Arg 
530 535 540 

Gly Lys Phe Leu Glu Met Cys Asn Asp Leu Leu Ala Arg Val Glu Pro 
545 550 555 560 

Pro Leu Arg Ser Val Leu Glu Gin Thr Lys Leu Lys Lys Glu Asp He 
565 570 575 

Tyr Ala Val Glu He Val Gly Gly Ala Thr Arg He Pro Ala Val Lys 
580 585 590 

Glu Lys He Ser Lys Phe Phe Gly Lys Glu Leu Ser Thr Thr Leu Asn 
595 600 605 

Ala Asp Glu Ala Val Thr Arg Gly Cys Ala Leu Gin Cys Ala He Leu 
6X0 615 620 

Ser Pro Ala Phe Lys Val Arg Glu Phe Ser He Thr Asp Val Val Pro 
625 630 635 640 

Tyr Pro He Ser Leu Arg Trp Asn Ser Pro Ala Glu Glu Gly Ser Ser 
645 650 655// 

Asp Cys Glu Val Phe Ser Lys Asn His Ala Ala Pro Phe Ser Lys Val 
660 665 670 

Leu Thr Phe Tyr Arg Lys Glu Pro Phe Thr Leu Glu Ala Tyr Tyr Ser 
675 680 685 

Ser Pro Gin Asp Leu Pro Tyr Pro Asp Pro Ala He Ala Gin Phe Ser 
690 695 700 

Val Gin Lys Val Thr Pro Gin Ser Asp Gly Ser Ser Ser Lys Val Lys 
705 710 715 720 

Val Lys Val Arg Val Asn Val His Gly He Phe Ser Val Ser Ser Ala 
725 730 735 

Ser Leu Val Glu Val His Lys Ser Glu Glu Asn Glu Glu Pro Met Glu 
740 745 750 

Thr Asp Gin Asn Ala Lys Glu Glu Glu Lys Met Gin Val Asp Gin Glu 
755 760 765 

Glu Pro His Val Glu Glu Gin Gin Gin Gin Thr Pro Ala Glu Asn Lys 
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770 775 780 

Ala Glu Ser Qlu Glu Met Glu Thr Ser Gin Ala Gly Ser Lys Asp Lys 
785 790 795 800 

Lys Met Asp Gin Pro Pro Gin Cys Gin Glu Gly Lye Ser Glu Asp Gin 
805 810 815 

Tyr Cfys Gly Pro Ala Asn Arg Glu Ser Ala He Trp Gin He Asp Arg 
820 825 830 

Glu Met Leu Asn Leu Tyr He Glu Asn Glu Gly Lys Met He Met Gin 
835 840 845 

Asp Lys Leu Glu Lys Glu Arg Asn Asp Ala Lys Asn Ala Val Glu Glu 
850 855 860 

Tyr Val Tyr Glu Met Arg Asp Lys Leu Ser Gly Glu Tyr Glu Lys Phe 
865 870 875 880 

Val Ser Glu Asp Asp Arg Asn Ser Phe Thr Leu Lys Leu Glu Asp Thr 
885 890 895 

Glu Asn Trp Leu Tyr Glu Asp Gly^ Glu Asp Gin Pro Lys Gin Val Tyr 
900 905 910 

Val Asp Lys Leu Ala Glu Leu Lys Asn Leu Gly Gin Pro He Lys He 
915 920 925 

Arg Phe Gin Glu Ser Glu Glu Arg Pro Asn Tyr Leu Lys 
930 935 940 



<210> 173 
<211> 2674 

<212> DNA \' \; 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequience: GPP-HSC70 

<220> 

<221> CDS 

<222> (1) . , (2673) 

<400> 173 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro - He Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 
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tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 BO 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 



240 



288 



336 



gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 3 8^ 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

-aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gae aac cac tac ctg age acc cag tec gee ctg 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg ace gcc gee ggg ate act etc ggc atg gac gag ctg tac aag tec 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct atg tee aag gga cct gea gtt ggt att gat ett ggc 768 
Gly Leu Arg Ser Met Ser Lys Gly Pro Ala Val Gly He Asp Leu Gly 
245 250 255 

ace acc tac tct tgt gtg ggt gtt ttc cag cac gga aaa gtc gag at a 816 
Thr Thr Tyr Ser Cys Val Gly Val Phe Gin His Gly Lys Val Glu He 
260 265 270 



528 



576 



624 



672 



720 
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att gcc aat gat cag gga aac. cga acc act cca age tat gtc gcc ttt 864 
lie Ala Asn Asp Gin Gly Asn Arg Thr Thr Pro Ser Tyr Val Ala Phe 
275 280 285 

acg gac act gaa egg ttg ate ggt gat gcc gca aag aat caa gtt gca 912 
Thr Asp Thr Glu Arg Leu He Gly Asp Ala Ala Lys Asn Gin Val Ala 
290 295 300 

atg aac ccc acc aac aca gtt ttt gat gcc aaa cgt ctg att gga cgc 960 
Met Asn Pro Tlir Asn Thr Val Phe Asp Ala Lys Arg Leu He Gly Arg 
305 310 315 320 

aga ttt gat gat get gtt gtc cag tct gat atg aaa cat tgg ccc ttt 1008 
Arg Phe Asp Asp Ala Val Val Gin Ser Asp Met Lys His Trp Pro Phe 
325 330 335 

atg gtg gtg aat gat get gge agg ccc aag gtc caa gta gaa tac aag 1056 
Met Val Val Asn Asp Ala Gly Arg Pro Lys Val Gin Val Glu Tyr Lys 
340 345 350 

gga gag acc aaa age ttc tat cca gag gag gtg tct tet atg gtt ctg 1104 
Gly Glu Thr Lys Ser Phe Tyr Pro Glu Glu Val Ser Ser Met Val Leu 
355 360 365 

aca aag atg aag gaa att gca gaa gcc tac ctt ggg aag act gtt acc 1152 
Thr Lys Met Lys Glu He Ala Glu Ala Tyr Leu Gly Lys Thr Val Thr 
370 375 380 

aat get gtg gtc aca gtg cca get tac ttt aat gac tct cag cgt cag 1200 
Asn Ala Val Val Thr Val Pro Ala Tyr Phe Asn Asp Ser Gin Arg Gin 
385 390 395 400 

get acc aaa gat get gga act att get ggt etc aat gta ctt aga att 1248 
Ala Thr Lys Asp Ala Gly Thr He Ala Gly Leu Asn Val Leu Arg He 
405 410 415 

att aat gag cca act get get get att get tac gge tta gac aaa aag 1296 
He Asn Glu Pro Thr Ala Ala Ala He Ala Tyr Gly Leu Asp Lys Lys 
420 425 430 

gtt gga gca gaa aga aac gtg etc ate ttt gac ctg gga ggt gge act 1344 
Val Gly Ala Glu Arg Asn Val Leu He Phe Asp Leu Gly Gly Gly Thr 
435 440 445 

ttt gat gtg tea ate etc act att gag gat gga ate ttt gag gtc aag 13 92 
Phe Asp Val Ser He Leu Thr He Glu Asp Gly He Phe Glu Val Lys 
450 455 460 

tct aca get gga gac ace cac ttg ggt gga gaa gat ttt gac aac cga 1440 
Ser Thr Ala Gly Asp Thr His Leu Gly Gly Glu Asp Phe Asp Asn Arg 
465 470 475 480 

atg gtc aac cat ttt att get gag ttt aag cgc aag eat aag aag gac 1488 
Met Val Asn His Phe He Ala Glu Phe Lys Arg Lys His Lys Lys Asp 
485 490 495 

ate agt gag aac aag aga get gta aga cgc etc cgt act get tgt gaa 1536 
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lie Ser Glu Asn Lye Arg Ala Val Arg Arg Leu Arg Thr Ala Cys Glu 
500 505 510 

cgt get aag cgt acc etc tct tec age acc cag gee agt att gag ate 1584 
Arg Ala Lys Arg Thr Leu Ser Ser Ser Thr Gin Ala Ser lie Glu lie 
515 520 525 

gat tct etc tat gaa gga ate gac ttc tat acc tec att acc cgt gcc 1632 
Asp Ser Leu Tyr Glu Gly lie Asp Phe Tyr Thr Ser lie Thr Arg Ala. 
530 535 540 

cga ttt gaa gaa ctg aat get gac ctg ttc cgt ggc acc ctg gac cca 1680 
Arg Phe Glu Glu Leu Asn Ala Asp Leu Phe Arg Gly Thr Leu Asp Pro 
545 550 555 560 

gta gag aaa gcc ett cga gat gcc aaa eta gac aag tea cag att cat 1728 
Val Glu Lys Ala Leu Arg Asp Ala Lys Leu Asp Lys Ser Gin lie His 
565 570 575 

gat att gtc ctg gtt ggt ggt tct act cgt ate ccc aag att cag aag 1776 
Asp lie Val Leu Val Gly Gly Ser Thr Arg lie Pro Lys lie Gin Lys 
580 585 590 

Gtt etc caa gac ttc ttc aat gga aaa gaa ctg aat aag age ate aac 1824 
Leu Leu Gin Asp Phe Phe Asn Gly Lys Glu Leu Asn Lys Ser lie Asn 
595 600 605 

cct gat gaa get gtt get tat ggt gca get gtc cag gca gee ate ttg 1872 
Pro Asp Glu Ala Val Ala Tyr Gly Ala Ala Val Gin Ala Ala lie Leu 
610 615 620 

tct gga gac aag tct gag aat gtt caa gat ttg ctg etc ttg gat gtc 1920 
Ser Gly Asp Lys Ser Glu Asn Val Gin Asp Leu Leu Leu Leu Asp Val 
625 630 635 640 

act cct ett tee ctt ggt att gaa act get .ggt gga gtc atg act gtc 1968 
Thr Pro Leu Ser Leu Gly lie Glu Thr Ala Gly Gly Val Met Thr Val 
645 650 655 

etc ate aag cgt aat acc acc att cct acc aag cag aca cag acc ttc 2016 
Leu lie Lys Arg Asn Thr Thr lie Pro Thr Lys Gin Thr Gin Thr Phe 
660 665 670 

act acc tat tct gac aac cag cct ggt gtg ctt att cag gtt tat gaa 2 064 
Thr Thr Tyr Ser Asp Asn Gin Pro Gly Val Leu He Gin Val Tyr Glu 
675 680 685 

ggc gag cgt gee atg aca aag gat aac aac ctg ctt ggc aag ttt gaa 2112 
Gly Glu Arg Ala Met Thr Lys Asp Asn Asn Leu Leu Gly Lys Phe Glu 
690 695 700 

etc aca ggc ata cct cct gca ccc cga ggt gtt cct cag att gaa gtc 2160 
Leu Thr Gly He Pro Pro Ala Pro Tlrg Gly Val Pro Gin He Glu Val 
705 710 715 720 

act ttt gac att gat gee aat ggt ata etc aat gtc tct get gtg gac 2208 
Thr Phe Asp He Asp Ala Asn Gly He Leu Asn Val Ser Ala Val Asp 
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725 730 735 

aag agt acg gga aaa gag aac aag att act ate act aat gac aag ggc 2256 
Lys Ser Thr Gly Lys Glu Asn Lys He Thr He Thr Asn Asp Lys Gly 
740 745 750 

cgt ttg age aag gaa gac att gaa cgt atg gtc cag gaa get gag aag 2304 
Arg Leu Ser Lys Glu Asp He Glu Arg Met Val Gin Glu Ala Glu Lye 
755 760 765 

tac aaa get gaa gat gag aag cag agg gac aag gtg tea tee aag aat 2352 
Tyr Lys Ala Glu Asp Glu Lys Gin Arg Asp Lys Val Ser Ser Lys Asn 
770 775 780 

tea ett gag tec tat gee ttc aac atg aaa gca act gtt gaa gat gag 2400 
Ser Leu Glu Ser Tyr Ala Phe Asn Met Lys Ala Tiir Val Glu Asp Glu 
785 790 795 800 

aaa ett caa ggc aag att aac gat gag gac aaa cag aag att etg gac 2448 
Lys Leu Gin Gly Lys He Asn Asp Glu Asp Lys Gin Lys He Leu Asp 
805 8X0 615 

aag tgt aat gaa att ate aac tgg ett gat aag aat cag act get gag 2496 
Lys Cys Asn Glu He He Asn Trp Leu Asp Lys Asn Gin Thr Ala Glu 
820 825 830 

aag gaa gaa ttt gaa cat caa cag aaa gag ctg gag aaa gtt tgc aac 2544 
Lys Glu Glu Phe Glu His Gin Gin Lys Glu Leu Glu Lys Val Cys Asn 
835 840 845 



cec ate ate acc aag ctg tac cag agt gca gga ggc atg cea gga gga 2592 
Pro He He Thr Lys Leu Tyr Gin Ser Ala Gly Gly Met Pro Gly Gly 
850 655 860 

atg cct ggg gga ttt ect ggt ggt gga get cct ccc tet ggt ggt get 2640 
Met Pro Gly Gly Phe Pro Gly Gly Gly Ala .Pro Pro Ser Gly Gly Ala 
865 870 875 880 



tec tea ggg ccc acc att gaa gag gtt gat taa g 2674 
Ser Ser Gly Pro Thr He Glu Glu Val Asp 
885 890 



<210> 174 
<211> 890 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: GFP-HSC7p 
<400> 174 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



145 



wo 00/50872 PCT/USOO/04794 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Tlir Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 9S 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 130 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Met Ser Lys Gly Pro Ala Val Gly He Asp Leu Gly 
245 250 255 

Thr Thr Tyr Ser Cys Val Gly Val Phe Gin His Gly Lys Val Glu He 
260 265 270 

He Ala Asn Asp Gin Gly Asn Arg Thr Thr Pro Ser Tyr Val Ala Phe 
275 280 285 

Thr Asp Thr Glu Arg Leu He Gly Asp Ala Ala Lys Asn Gin Val Ala 
290 295 300 

Met Asn Pro Thr Asn Thr Val Phe Asp Ala Lys Arg Leu He Gly Arg 
305 310 315 320 

Arg Phe Asp Asp Ala Val Val Gin Ser Asp Met Lys His Trp Pro Phe 
325 330 335 



146 



wo 00/50872 PCT/USOO/04794 



Met Val Val Asn Asp 
340 

Gly Glu Tlir Lys Ser 
355 

Thr Lys Met Lys Glu 
370 

Asn Ala Val Val Thr 
385 

Ala Thr Lys Asp Ala 
405 

lie Asn Glu Pro Thr 
420 

Val Gly Ala Glu Arg 
435 

Phe Asp Val Ser lie 
450 

Ser Thr Ala Gly Asp 
465 

Met Val Asn His Phe 
485 

lie Ser Glu Asn Lys 
500 

Arg Ala Lys Arg Thr 
515 

Asp Ser Leu Tyr Glu 
530 

Arg Phe Glu Glu Leu 
545 

Val Glu Lys Ala Leu 
565 

Asp lie Val Leu Val 
580 

Leu Leu Gin Asp Phe 
595 

Pro Asp Glu Ala Val 
610 

Ser Gly Asp Lys Ser 
625 



Ala Gly Arg Pro Lys 
345 

Phe Tyr Pro Glu Glu 
360 

He Ala Glu Ala Tyr 
375 

Val Pro Ala Tyr Phe 
390 

Gly Thr He Ala, Gly 
410 

Ala Ala Ala He Ala 
425 

Asn Val Leu He Phe 
440 

Leu Thr He Glu Asp 
455 

Thr His Leu Gly Gly 
470 

He Ala Glu Phe Lys 
490 

Arg Ala Val Arg Arg 
505 

Leu Ser Ser Ser Thr 
520 

Gly He Asp Phe Tyr 
535 

Asn Ala Asp Leu Phe 
550 

Arg Asp Ala Lys Leu 
570 

Gly Gly Ser Thr Arg 
585 

Phe Asn Gly Lys Glu 
600 

Ala Tyr Gly Ala Ala 
615 

Glu Asn Val Gin Asp 
630 



Val Gin Val Glu Tyr Lys 
350 

Val Ser Ser Met Val Leu 
365 

Leu Gly Lys Thr Val Thr 
380 

Asn Asp Ser Gin Arg Gin 
395 400 

Leu Asn Val Leu Arg He 
415 

Tyr Gly Leu Asp Lys Lys 
430 

Asp Leu Gly Gly Gly Thr 
445 

Gly He Phe Glu Val Lys 
460 

Glu Asp Phe Asp Asn Arg 
475 480 

Arg Lys His Lys Lys Asp 
495 

Leu Arg Thr Ala Cys Glu 
510 

Gin Ala Ser He Glu He 
525 

Thr Ser He Thr Arg Ala 
540 

Arg Gly Thr Leu Asp Pro 
555 560 

Asp Lys Ser Gin He His 
575 

He Pro Lys He Gin Lys 
590 

Leu Asn Lys Ser He Asn 
605 

Val Gin Ala Ala He Leu 
620 

Leu Leu Leu Leu Asp Val 
635 640 
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Thr Pro Leu Ser Leu Gly He Glu Thr Ala Gly Gly Val Met Thr Val 
645 650 655 

Leu He Lys Arg Asn Thr Thr He Pro Thr Lys Gin Thr Gin Thr Phe 
660 665 670 

Thr Thr Tyr Ser Asp Asn Gin Pro Gly Val Leu He Gin Val Tyr Glu 
675 680 685 

Gly Glu Arg Ala Met Thr Lys Asp Asn Asn Leu Leu Gly Lys Phe Glu 
690 695 700 

Leu Thr Gly He Pro Pro Ala Pro Arg Gly Val Pro Gin He Glu Val 
705 710 715 720 

Thr Phe Asp He Asp TU-a Asn Gly He Leu Asn Val Ser Ala Val Asp 
725 730 735 

Lys Ser Thr Gly Lys Glu Asn Lys He Thr He Thr Asn Asp Lys Gly 
740 745 750 

Arg Leu Ser Lys Glu Asp He Glu Arg Met Val Gin Glu Ala Glu Lys 
755 760 765 

Tyr Lys Ala Glu Asp Glu Lys. Gin Arg Asp Lys Val Ser Ser Lys Asn 
. 770 7T5 780 

Ser Leu Glu Ser Tyr Ala Phe Asn Met Lys Ala Thr Val Glu Asp Glu 
785 790 795 BOO 

Lys Leu Gin Gly Lys He Asn T^p Glu Asp Lys Gin Lys He Leu Asp 
805 810 815 

Lys Cys Asn Glu He He Asn Trp Leu Asp Lys Asn Gin Thr Ala Glu 
820 825 . 830 

Lys Glu Glu Phe Glu His Gin Gin Lys Glu Leu Glu Lys Val Cys Asn 
835 840 845 

Pro He He Thr Lys Leu Tyr Gin Ser Ala Gly Gly Met Pro Gly Gly 
850 855 860 

Met Pro Gly Gly Phe Pro Gly Gly Gly Ala Pro Pro Ser Gly Gly Ala 
865 870 875 880 

Ser Ser Gly Pro Thr He Glu Glu Val Asp 
885 890 



<210> 175 
<211> 2458 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSFl 
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<220> 

<221> CDS 

<222> (1) . . (2349) 

<400> 175 



aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



4B 



96 



atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc ace acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thx* 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg' Tyr Pro Asp His Met Lys 

65 • 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Axg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 12 0 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



28B 



336 



480 



ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
IBO 185 190 



624 
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age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc ggc atg gae gag ctg tac aag tec 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

225 230 235 240 



720 



816 



864 



gga etc aga tct cga get caa get teg aat tct gca gtc gag atg gat 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Val Glu Met Asp 
245 250 255 

ctg ccc gtg ggc ccc ggc gcg gcg ggg ccc age aac gtc ccg gcc ttc 
Leu Pro Val Gly Pro Gly Ala Ala Gly Pro Ser Asn Val Pro Ala Phe 
260 265 270 

ctg acc aag ctg tgg acc etc gtg age gac ccg gac acc gac gcg etc 
Leu Thr Lys Leu Trp Thr Leu Val Ser Asp Pro Asp Thr Asp Ala Leu 
275 * 280 285 

ate tgc tgg age ccg age ggg aac age ttc cac gtg ttc gac cag ggc 912 
He Cys Trp Ser Pro Ser Gly Asn Ser Phe His Val Phe Asp Gin Gly 
290 295 300 

cag ttt gee aag gag gtg ctg ccc aag tac ttc aag cac aac aac atg 
Gin Phe Ala Lys Glu Val Leu Pro Lys Tyr Phe Lys His Asn Asn Met 
305 310 315 320 

gcc age ttc gtg egg cag etc aac atg tat ggc ttc egg aaa gtg gtc 
Ala Ser Phe Val Arg Gin Leu Asn Met Tyr Gly Phe Arg Lys Val Val 
325 330 335 

cac ate gag cag ggc ggc ctg gtc aag eca gag aga gae gae acg gag 
His He Glu Gin Gly Gly Leu Val Lys Pro Glu Arg Asp Asp Thr Glu 
340 345 . 350 

ttc cag cac eca tgc ttc ctg cgt ggc cag gag cag etc ett gag aac 
Phe Gin His Pro Cys Phe Leu Arg Gly Gin Glu Gin Leu Leu Glu Asn 
355 360 365 

ate aag agg aaa gtg acc agt gtg tec acc ctg aag agt gaa gac ata 
He Lys Arg Lys Val Thr Ser Val Ser Thr Leu Lys Ser Glu Asp He 
370 375 380 

aag ate cgc cag gac age gtc acc aag ctg ctg acg gac gtg cag . ctg 
Lys He Arg Gin Asp Ser Val Thr Lys Leu Leu Thr Asp Val Gin Leu 
385 390 395 400 

atg aag ggg aag cag gag tgc atg gae tee aag etc ctg gcc atg aag 1248 
Met Lys Gly Lys Gin Glu Cys Met Asp Ser Lys Leu Leu Ala Met Lys 
405 410 415 

cat gag aat gag get ctg tgg egg gag gtg gcc age ctt egg cag aag 1296 
His Glu Asn Glu Ala Leu Trp Arg Glu Val Ala Ser Leu Arg Gin Lys 
420 425 430 
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1104 



1152 
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cat gcc cag caa cag aaa gtc gtc aac aag etc att cag ttc ctg ate 1344 
His Ala Gin Gin Gin Lys Val Val Asn Lys Leu He Gin Phe Leu He 
435 440 445 

tea ctg gtg cag tea aac egg ate ctg ggg gtg aag aga aag ate ccc 1392 
Ser Leu Val Gin Ser Asn Arg He Leu Gly Val Lys Arg Lys He Pro 
450 455 460 

ctg atg ctg aac gac agt ggc tea gca cat tec atg ccc aag tat age 1440 
Leu Met Leu Asn Asp Ser Gly Ser Ala His Ser Met Pro Lys Tyr Ser 
465 470 475 480 

egg cag ttc tec ctg gag cac gtc cac ggc teg ggc ccc tac teg gcc 1488 
Arg Gin Phe Ser Leu Glu His Val His Gly Ser Gly Pro Tyr Ser Ala 
485 490 495 

ccc tec oca gcc tac age age tec age etc tac gcc cct gat. get gtg 1536 
Pro Ser Pro Ala Tyr Ser Ser Ser Ser Leu Tyr Ala Pro Asp Ala Val 
500 505 510 

gcc age tct gga ccc ate ate tec gac ate ace gag ctg get cct gcc 1584 
Ala Ser Ser Gly Pro He He Ser Asp He Thr Glu Leu Ala Pro Ala 
515 520 525 

age ccc atg gcc tec ccc ggc ggg age ata gac gag agg ccc eta tec 1632 
Ser Pro Met Ala Ser Pro Gly Gly Ser He Asp Glu Arg Pro Leu Ser 
530 535 540 

age age ccc ctg gtg cgt gtc aag gag gag ccc ccc age ecg cct cag 1680 
Ser Ser Pro Leu Val Arg Val Lys Glu Glu Pro Pro Ser Pro Pro Gin 
545 550 555 560 

age ccc egg gta gag gag gcg agt ccc ggg cgc eca tct tee gtg gac 1728 
Ser Pro Arg Val Glu Glu Ala Ser Pro^Gly Arg Pro Ser Ser Val Asp 
565 570 575 

acc etc ttg tec ccg acc gcc etc att gac tec ate ctg egg gag agt 1776 
Thr Leu Leu Ser Pro Thr Ala Leu lie Asp Ser He Leu Arg Glu Ser 
580 585 590 

gaa cct gee ccc gee tec gtc aca gcc etc aeg gac gcc agg ggc cac 1824 
Glu Pro Ala Pro Ala Ser Val Thr Ala Leu Thr Asp Ala Arg Gly His 
595 600 605 

acg gac acc gag ggc egg cct ccc tec ccc ccg ccc acc tec acc cct .1872 
Thr Asp Thr Glu Gly Arg Pro Pro Ser Pro Pro Pro Thr Ser Thr Pro 
610 615 620 

gaa aag tgc etc age gta gcc tgc ctg gac aag aat gag etc agt gac 1920 
Glu Lys Cys Leu Ser Val Ala Cys Leu Asp Lys Asn Glu Leu Ser Asp 
625 630 635 640 

cac ttg gat get atg gac tec aac ctg gat aac ctg cag ace atg ctg 1968 
His Leu Asp Ala Met Asp Ser Asn Leu Asp Asn Leu Gin Thr Met Leu 
645 650 655 

age age cac ggc ttc age gtg gac acc agt gcc ctg ctg gac ctg ttc 2016 
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Ser Ser His Gly Phe Ser Val Asp Thr Ser Ala Leu Leu Asp Leu Phe 
660 S6S 670 

age ccc teg gtg acc gtg ccc gac atg age ctg cct gac ctt gac age 

Ser Pro Ser Val Thr Val Pro Asp Met Ser Leu Pro Asp Leu Asp Ser 
675 6B5 



2054 



age ctg gee agt ate caa gag etc ctg tct ccc cag gag ccc ccc agg 2112 
Ser Leu Ala Ser He Gin Glu Leu Leu Ser Pro Gin Glu Pro Pro Arg 
690 695 700 

cct ccc gag gca gag aac age age ccg gat tea ggg aag cag ctg gtg 2160 
Pro Pro Glu Ala Glu Asn Ser Ser Pro Asp Ser Gly Lys Gin Leu Val 
705 710 715 720 

cae tac aca gcg cag ccg ctg ttc ctg ctg gac ccc ggc tec gtg gac 2208 
His Tyr Thr Ala Gin Pro Leu Phe Leu Leu Asp Pro Gly Ser Val Asp 
725 730 735 

acc ggg age aac gac ctg ccg gtg ctg ttt gag ctg gga gag ggc tec 2256 
Thr Gly Ser Asn Asp Leu Pro Val Leu Phe Glu Leu Gly Glu Gly Ser 
740 745 750 

tac ttc tec gaa ggg gac ggc ttc gcc gag gac ccc acc ate tec ctg 2304 
Tyr Phe Ser Glu Gly Asp Gly Phe Ala Glu Asp Pro Thr He Ser Leu 
755 760 - 765 

ctg aca ggc teg gag cct ccc aaa gcc aag gac ccc act gtc tec 2349 
Leu Thr Gly Ser Glu Pro Pro Lys Ala Lys Asp Pro Thr Val Ser 
770 775 780 

tagaggeccc ggaggagctg ggccagccge ccacceccac ccccagtgca gggctggtet 2409 

tggggaggca gggcagcctc gcggtcttgg gcactggtgg gtcggccgg 2458 



<210> 176 
<211> 783 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSFl 
<400> 176 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 ^2 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 
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Leu Thr Tyr Gly.Val Gin Cys Pl^® 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys, Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195- 200 -205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Val Glu Met Asp 
245 250 255 

Leu Pro Val Gly Pro Gly Ala Ala Gly Pro Ser Asn Val Pro Ala Phe 
260 265 270 

Leu Thr Lys Leu Trp Thr Leu Val Ser Asp Pro Asp Thr Asp Ala Leu 
275 280 285 

He Cys Trp Ser Pro Ser Gly Asn Ser Phe His Val Phe Asp Gin Gly 
290 295 300 

Gin Phe Ala Lys Glu Val Leu Pro Lys Tyr Phe Lys His Asn Asn Met 
305 310 315 320 

Ala Ser Phe Val Arg Gin Leu Asn Met Tyr Gly Phe Arg Lys Val Val 
325 330 335 

His He Glu Gin Gly, Gly Leu Val Lys Pro Glu Arg Asp Asp Thr Glu 
340 345 350 

Phe Gin His Pro Cys Phe Leu Arg Gly Gin Glu Gin Leu Leu Glu Asn 
355 360 365 
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He Lys Arg Lys Val Thr Ser Val Ser Thr Leu Lys Ser Glu Asp He 
370 375 380 

Lys He Arg Gin Asp Ser Val Thr Lys Leu Leu Thr Asp Val Gin Leu 
385 390 295 40O 

Met Lys Gly Lys Gin Glu Cys Met Asp. Ser Lys Leu Leu Ala Met Lys 
405 410 415 

His Glu Asn Glu Ala Leu Trp Arg Glu Val Ala Ser Leu Arg Gin Lys 
420 425 430 

His Ala Gin Gin Gin Lys Val Val Asn Lys Leu He Gin Phe Leu He 
435 440 445 

Ser Leu Val Gin Ser Asn Arg He Leu Gly Val Lys Arg Lys He Pro 
450 455 460 

Leu Met Leu Asn Asp Ser Gly Ser Ala His Ser Met Pro Lys Tyr Ser 
465 470 475 480 

Arg Gin Phe Ser Leu Glu His Val His Gly Ser Gly Pro Tyr Ser Ala 
485 490 495 

Pro Ser Pro Ala Tyr Ser Ser Ser Ser Leu Tyr Ala Pro Asp Ala Val 
500 505 510. 

Ala Ser Ser Gly Pro He He Ser Asp He Thr Glu Leu Ala Pro Ala 
515 520 525 

Ser Pro Met Ala Ser Pro Gly Gly Ser He Asp Glu Arg Pro Leu Ser* 
530 535 ■ 540 

Ser Ser Pro Leu Val Arg Val Lys Glu Glu Pro Pro Ser Pro Pro Gin 
545 550 555 560 

Ser Pro Arg Val Glu Glu Ala Ser Pro Gly Arg Pro Ser Ser Val Asp 
565 570 575 

Thr Leu Leu Ser Pro Thr Ala Leu He Asp Ser He Leu Arg Glu Sex 
580 585 590 

Glu Pro Ala Pro Ala Ser Val Thr Ala Leu Thr Asp Ala Arg Gly His 
595 600 605 

Thr Asp Thr Glu Gly Arg Pro Pro Ser Pro Pro Pro Thr Ser Thr Pro 
610 615 620 

Glu Lys Cys Leu Ser Val Ala Cys Leu Asp Lys Asn Glu Leu Ser Asp 
625 630 635 640 

His Leu Asp Ala Met Asp Ser Asn Leu Asp Asn Leu Gin Thr Met Leu 
645 650 655 

Ser Ser His Gly Phe Ser Val Asp Thr Ser Ala Leu Leu Asp Leu Phe 
660 665 670 
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Ser Pro Ser Val Thr Val Pro Asp Met Ser Leu Pro Asp Leu Asp Ser 
675 680 685 

Ser Leu Ala Ser lie Gin Glu Leu Leu Ser Pro Gin Glu Pro Pro Arg 
690 695 700 

Pro Pro Glu Ala Glu Asn Ser Ser Pro Asp Ser Gly Lys Gin Leu Val 
705 710 715 720 

His Tyr Thr Ala Gin Pro Leu Phe Leu Leu Aep Pro Gly Ser Val Asp 
725 730 735 

Thr Gly Ser Asn Asp Leu Pro Val Leu Phe Glu Leu Gly Glu Gly Ser 
740 745 750 

Tyr Phe Ser Glu Gly Asp Gly Phe Ala Glu Asp Pro Thr lie Ser Leu 
755 760 765 

Leu Thr Gly Ser Glu Pro Pro Lys Ala Lys Asp Pro Thr Val Ser 
770 775 780 



<210> 177 
<211> 2416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPP-NPKB 

<220> 

<221> CDS 

<222> (1) - . (24X5) 

<400> 177 

atg gtg age aag ggc gag gag ctg ttc acc .ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 



155 



wo 00/50872 



PCT/USOO/04794 



85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cae aag ctg gag tac 432 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 480 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn. 

145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 



gtg cag etc gcc gac cac tae cag cag aac acc cec ate ggc gac ggc 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
ISO 185 190 



gga etc aga tct ega gat ccg cec ttc atg gae gaa ctg ttc ccc etc 
Gly Leu Arg Ser Arg Asp Pro Pro Phe Met Asp Glu Leu Phe Pro Leu 



245 250 255 



ggg cgc tee geg ggc age ate cea ggc gag agg age aca gat acc acc 

Gly Arg Ser Ala Gly Ser He Pro Gly Glu Arg Ser Thr Asp Thr Thr 

290 295 300 

aag acc cac cec acc ate aag ate aat ggc tae aca gga eca ggg aca 

Lys Thr His Pro Thr He Lys He Asn Gly Tyr Thr Gly Pro Gly Thr 
305 310 315 320 



576 



ccc gtg ctg ctg ccc gae aac cac tae ctg age ace cag tec gcc ctg- " 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac cec aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg ace gcc gee ggg ate act etc ggc atg gae gag ctg tae aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 



768 



ate ttc ccg gea gag cea gcc cag gcc tct ggc ccc tat gtg gagf ate 816 
He Phe Pro Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu He 
260 265 270 

att gag cag cec aag cag egg ggc atg cgc ttc cgc tac aag tge gag 864 
He Glu Gin Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys Cys Glu 
275 280 285 



912 
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gtg cgc ate tec ctg gtc acc aag gac cct cct cac egg cct cac ccc 1008 
Val Arg lie Ser lieu Val Thr Lys Asp Pro Pro His Arg Pro His Pro 
325 330 335 

cac gag ctt gta gga aag gac tgc egg gat ggc ttc tat gag get gag 1056 
His Glu Leu Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu 
340 345 350 

etc tgc ccg gac cgc tgc ate cac agt ttc cag aac ctg gga ate cag 1104 
Leu Cys Pro Asp Arg Cys He His Ser Phe Gin Asn Leu Gly lie Gin 
355 360 365 

tgt gtg aag aag egg gac ctg gag cag get ate agt cag cgc ate cag 1152 
Cys Val Lys Lys Arg Asp Leu Glu Gin Ala lie Ser Gin Arg lie Gin 
370 375 380 

acc aac aac aac ecc ttc caa gtt cct ata gaa gag cag egt ggg gac 1200 
Thr Asn Asn Asn Pro Phe Gin Val Pro lie Glu Glu Gin Arg Gly Asp 
385 390 395 400 

tac gac ctg aat get gtg egg etc tgc ttc cag gtg aca gtg egg gae 124 8 
Tyr Asp Leu Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp 
405 410 415 

eea tea ggc agg ecc etc cgc ctg ccg cct gtc ctt tct cat cec ate 1296 
Pro Ser Gly Arg Pro Leu Arg Leu Pro Pro Val Leu Ser His Pro lie 
420 425 430 

ttt gac aat egt gee ccc aac act gcc gag etc aag ate tgc cga gtg 1344 
Phe Asp Asn Arg Ala Pro Asn Thr Ala Glu Leu Lys lie Cys Arg Val 
435 440 445 

aac cga aac tct ggc age tgc^ctc ggt ggg gat gag ate ttc eta ctg 13 92 
Asn Arg Asn Ser Gly Ser Cys Leu Gly Gly Asp Glu lie Phe Leu Leu 
450 455 , 460 

tgt gac aag gtg cag aaa gag gac att gag gtg tat ttc aeg gga cca 1440 
Cys Asp Lys Val Gin Lys Glu Asp lie Glu Val Tyr Phe Thr Gly Pro 
465 470 475 ' 480 

ggc tgg gag gcc cga ggc tec ttt teg caa get gat gtg cac cga caa 14 88 
Gly Trp Glu Ala Arg Gly Ser Phe Ser Gin Ala Asp Val His Arg Gin 
485 490 495 

gtg gcc att gtg ttc egg acc cct ecc tac gea gac ccc age ctg cag 1536 
Val Ala lie Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin 
500 505 510 

get cct gtg egt gtc tec atg cag ctg egg egg cct tec gac egg gag 1584 
Ala Pro Val Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu 
515 520 525 

etc agt gag ccc atg gaa ttc cag tac ctg cca gat aca gac gat egt 1632 
Leu Ser Glu Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg 
530 535 540 
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cac egg att gag gag aaa cgt aaa agg aca tat gag acc ttc aag age 1S80 
His Arg lie Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Isys Ser 
545 550 555 560 

ate atg aag aag agt cct ttc age gga ccc acc gac ccc egg eet eca 1728 
lie Met Lye Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro 
565 570 575 

cct cga cgc att get gtg cct tec cgc age tea get tet gtc ccc aag 1776 
Pro Arg Arg lie Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys 
580 585 590 

eca gca ccc cag ccc tat ccc ttt acg tea tec ctg age acc ate aac 1824 
Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr lie Asn 
595 600 605 

tat gat gag ttt ccc ace atg gtg ttt cct tet ggg cag ate age cag 1872 
Tyr Asp Glu Phe Pro Thr Met Val Phe Pro Ser Gly Gin lie Ser Gin 
610 615 620 



gee teg gcc ttg gee eeg gee cct ccc caa gtc ctg ccc. cag get eca 
Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro 
625 630 635 640 

gcc cct gcc cct get eca gee atg gta tea get ctg gee cag gcc eca 
Ala Pro Ala Pro Ala Pro Ala Met Val Ser Ala Leii Ala Gin Ala Pro 
645 650 655 

gcc cct . gtc eca gtc eta gee eca ggc cct cct cag get gtg gcc eca 
Ala Pro Val Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro 
660 665 670 



gtg aca gee cag agg ccc ccc gac eca get cct get eca ctg ggg gcc 
Val Thr Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala 
755 760 765 

ccg ggg etc ccc aat ggc etc ctt tea gga gat gaa gac ttc tee tec 



1920 



1968 



2016 



cct gcc ccc aag ccc ace cag get ggg gaa gga acg ctg tea gag gcc 2064 

Pro Ala Pro Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala 
675 680 685 

ctg ctg cag ctg cag ttt gat gat gaa gac ctg ggg gee ttg ctt ggc 

Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp Leu Gly hla Leu Leu Gly 
690 695 700 



2112 



aac age aca gac eca get gtg ttc aca gac ctg gca tec gtc gac aac 2160 
Asn Ser Thr Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn 
705 710 715 720 

tec gag ttt cag cag ctg ctg aac cag ggc ata cct gtg gcc ccc cac 2208 
Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly lie Pro Val Ala Pro His 
725 730 735 

aca act gag ccc atg ctg atg gag tac cct gag get ata act cgc eta 2256 
Thr Thr Glu Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu 
740 745 750 
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Pro Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser 
770 775 780 

att gcg gac atg gac ttc tea gcc ctg ctg agt cag ate age tec aag 24 00 
lie Ala Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser Lys 
785 790 795 BOO 

ggc gaa ttc gaa get t 2416 
Gly Glu Phe Glu Ala 
805 



<210> 178 
<211> 805 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-NFKB 
<400> 17B. 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15^ 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 . 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125- 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro Xle Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro 
X95 

5er Lys Asp Pro Asn 
210 

Val Thr Ala Ala Gly 
225 

Gly Leu Arg Ser Arg 
245 

He Phe Pro Ala Glu 
260 

He Glu Gin Pro Lys 
275 

Gly Arg Ser Ala Gly 
290 

Lys Thr His Pro Tbx 
305 

Val Arg He Ser Leu 
325 

His Glu Leu Val Gly 
340 

Leu Cys Pro Asp Arg 
355 

Cys Val Lys Lys Arg 
370 

Thr Asn Asn Asn Pro 
385 

Tyr Asp Leu Asn Ala 
405 

Pro Ser Gly Arg Pro 
420 

Phe Asp Asn Arg Ala 
435 

Asn Arg Asn Ser Gly 
450 

Cys Asp Lys Val Gin 
465 

Gly Trp Glu Ala Arg 
485 



Asp Asn His Tyr Leu Ser 
200 

Glu Lys Arg Asp His Met 
. 215 

He Thr Leu Gly Met Asp 
230 235 

Asp Pro Pro Phe Met Asp 
250 

Pro Ala Gin Ala Ser Gly 
265 

Gin Arg Gly Met Arg Phe 
280 . 

Ser He Pro Gly Glu Arg 
295 

He Lys He Asn Gly Tyr 
310 315 

Val Thr Lys Asp Pro Pro 
330 

Lys Asp Cys Arg Asp Gly 
345 

Cys He His Ser Phe Gin 
360 

Asp Leu Glu Gin Ala He 
375 

Phe Gin Val Pro He Glu 
390 395 

Val Arg Leu Cys Phe Gin 
410 

Leu Arg Leu Pro Pro Val 
425 

Pro Asn Thr Ala Glu Leu 
440 

Ser Cys Leu Gly Gly Asp 
455 

Lys Glu Asp He Glu Val 
470 475 

Gly Ser Phe Ser Gin Ala 
490 



Thr Gin Ser Ala Leu 
205 

Val Leu Leu Glu Phe 
220 

Glu Leu Tyr Lys Ser 
240 

Glu Leu Phe Pro Leu 
255 

Pro Tyr Val Glu He 
270 

Arg Tyr Lys Cys Glu 
285 

Ser Th.r Asp Thr Thr 
300 

Thr Gly Pro Gly Thr 
320 

His Arg Pro His Pro 
335 

Phie Tyr Glu Ala Glu 
350 

Asn Leu Gly He Gin 
365 

Ser Gin Arg He Gin 
380 

Glu Gin Arg Gly Asp 
400 

Val Thr Val Arg Asp 
415 

Leu Ser His Pro He 
430 

Lys He Cys Arg Val 
445 

Glu He Phe Leu Leu 
460 

Tyr Phe Thr Gly Pro 
480 

Asp Val His Arg Gin 
495 
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Val Ala He Val Phe Arg Tiir Pro Pro Tyr Ala Asp Pro Ser Leu Gin 
500 505 510 

Ala Pro Val Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu 
515 520 525 

Leu Ser Glu Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg 
530 535 540 

His Arg He Glu Glu Lys Arg Lys Arg Tlxr Tyr Glu Thr Phe Lys Ser 
545 550 555 560 

He Met Lys Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro 
565 570 575 

Pro Arg Arg He Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys 
580 585 590 

Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr He Asn 
595 600 605 

Tyr Asp Glu Phe Pro Thr Met Val Phe Pro Ser Gly Gin He Ser Gin 
610 615 620 

Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro 
625 630 635 . 640 

Ala Pro Ala Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro 
645 650 655 

Ala Pro Val Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro 
660 665 670 

Pro AFa Pro Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala 
675 680 685 

Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly 
690 695 700 

Asn Ser Thr Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn 
705 710 715 720 

Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His 
725 730 735 

Thr Thr Glu Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu 
740 745 750 

Val Thr Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala 
755 760 765 

Pro Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser 
770 775 780 

He Ala Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser Lys 
785 790 795 800 
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Gly Glu Phe Glu Ala 
805 



48 



96 



<210> 179 

<211> 1677 

<212> DKTA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-IKB 

<220> . - 

<221> CDS 
<222> (1) , . (1674) 

<400> 179 

atg ttc cag gcg get gag cgc ccc cag gag tgg gcc atg gag ggc ccc 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 
i 5 10 15 

cgc gac ggg ctg aag aag gag egg eta ctg gac gac cgc cac gac age 
Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
20 25 30 

ggc ctg gac tec atg aaa gac gag gag tac gag cag atg gtc aag gag 144 
Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 40 45 

ctg cag gag ate cgc etc gag ecg cag gag gtg ccg cge ggc teg gag 192 
Leu Gin Glu He Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 55 60 

ccc tgg aag cag cag etc ace gag gac ggg gac teg ttc ctg cac ttg 24 0 
Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 . .75 80 

gcc ate ate cat gaa gaa aag gca ctg ace atg gaa gtg ate cgc cag 288 
Ala He He His Glu Glu Lys Ala Leu Thr Met Glu Val He Arg Gin 
85 90 95 

gtg aag gga gac ctg gcc ttc etc aac etc cag aac aac ctg cag cag 33 6 
Val Lys Gly Asp Leu Ala Phe Leu Asn Leu Gin Asn Asn Leu Gin Gin 
100 105 110 

act eca etc cac ttg get gtg ate ace aac cag cca gaa att get gag 3 84 
Thr Pro Leu His Leu Ala Val He Thr Asn Gin Pro Glu He Ala Glu 
115 120 125 

gca ctt ctg gga get ggc tgt gat cct gag etc cga gac ttt ega gga 432 
Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 ■ 135 140 



aat ace ccc eta cac ctt gee tgt gag cag ggc tgc ctg gcc age gtg 
Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155 160 
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gga gtc ctg act cag tec tgc acc acc ccg cac etc cac tec ate ttg 
Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser He Leu 
165 170 175 



528 



aag get acc aac tac aat ggc cac acg tgt eta cac tta gee tct ate 576 
Lys Ala Thr Asn Tyr Asn Gly His Thr Cys Leu His Leu Ala Ser He 
180 185 190 

cat ggc tac ctg ggc ate gtg gag ctt ttg gtg tec ttg ggt get gat 624 
His Gly Tyr Leu Gly He Val Glu Leu Leu Val Ser Leu Gly Ala Asp 
195 200 205 

gtc aat get cag gag ecc tgt aat ggc egg act gee ctt cac etc gca 672 
Val Asn Ala Gin Glu Pro Cys Asn Gly Arg Thr Ala Leu His Leu Ala 
210 215 220 



gtg gac ctg caa aat cet gac ctg gtg tea etc ctg ttg aag tgt ggg 
Val Asp Leu Gin Asn Pro Asp Leu Val Ser Leu Leu Leu Lys Cys Gly 
225 230 235 240 



tat gat gac tgt gtg ttt gga ggc cag cgt ctg acg tta acc ggt atg 

Tyr Asp Asp Cys Val Phe Gly Gly Gin Arg Leu Thr Leu Thr Gly Met 
305 310 315 320 

get age aaa gga gaa gaa etc ttc act gga gtt gtc cea att ctt gtt 

Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 

325 330 335 



ggt gaa ggt gat gca aca tac gga aaa ctt ace ctg aag ttc ate tgc 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
355 360 365 

act act ggc aaa ctg cet gtt cca tgg cca aca eta gtc act act ctg 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
370 375 380 

tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat cat atg aaa egg 



720 



get gat gtc aac aga gtt acc tac cag ggc tat tct ecc tac cag etc 768 
Ala Asp Val Asn Arg. Val Thr Tyr Gin Gly Tyr Ser Pro Tyr Gin Leu 
245 250 255 

acc tgg ggc cgc cca age acc egg ata cag cag cag ctg ggc cag ctg 816 
Thr Trp Gly Arg Pro Ser Thr Arg He Gin Gin Gin Leu Gly Gin Leu 
260 265 270 

aca eta gaa aac ctt cag atg ctg cca gag agt gag gat gag gag age 864 
Thr Leu Glu Asn Leu Gin Met Leu Pro Glu Ser Glu Asp Glu Glu Ser 
275 280 285 

tat gac aca gag tea gag ttc acg gag ttc aca gag gac gag ctg ecc 912 
Tyr Asp Thr Glu B^x Glu Phe Thr Glu Phe Thr Glu Asp Glu Leu Pro 
290 295 300 



960 



1008 



gaa tta gat ggt gat gtt aac ggc cac aag ttc tct gtc agt gga gag 1056 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
340 345 350 
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1152 



1200 



163 



wo 00/50872 



PCT/USOO/04794 



Cys Tyr Qly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
385 390 395 400 



cat gac ttt ttc aag agt gcc atg ccc gaa ggt tat gta cag gaa agg 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
405 410 415 



124B 



acc ate ttc ttc aaa gat gac ggc aac tac aag aca cgt get gaa gtc 
Thr He Phe Phe Lys Asp. Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
420 425 430 



1296 



aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt att 1344 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
435 440 445 

gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac aac 1392 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
45*0 455 460 



tat aac tea cac aat gta tac ate atg gca gad aaa caa aag aat gga 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
465 470 475 480 



1440 



ate aaa gtg aac ttc aag acc cgc cac aac att gaa gat gga age gtt 
He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser Val 
485 490 495 



1488 



caa eta gca gac cat tat caa caa aat act eca att ggc gat ggc cet 1536 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
500 505 510 

gtc ctt tta eca gac aac cat tac ctg tec aca caa tct gcc ctt teg 1584 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
515 / / 520 525 

aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt gag ttt gta 1632 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
530 535 540 

aca get get ggg att aca cat ggc atg gat gaa ctg tac aac tag 1677 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
545 550 555 



<210> 180 
<211> 558 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-IKB 



<400> 180 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 

1 5 10 15 

Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
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20 25 30 

Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 40 45 

Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 55 60 

Pro Trp Lys Gin Gin Leu Tlir Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

Ala lie He His Glu Glu Lys Ala Leu Thr Met Glu Val He Arg Gin 
85 90 95 

Val Lys Gly Asp Leu Ala Phe Leu Asn Leu Gin Asn Asn Leu Gin Gin 
100 105 110 

Thr Pro Leu His Leu Ala Val He Thr Asn Gin Pro Glu He Ala Glu 
115 120 125 

Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 135 140 

Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155* 160 

Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser He Leu 
165 170 175 

Lys Ala Thr Asn Tyr Asn Gly His Thr Cys Leu His Leu Ala Ser He 
180 185 190 

His Gly Tyr Leu Gly He Val Glu Leu Leu Val Ser Leu Gly Ala Asp 
195 ^200 205 

Val Asn Ala Gin Glu Pro Cys Asn Gly Arg Thr Ala Leu His Leu Ala 
210 215 220 

Val Asp Leu Gin Asn Pro Asp Leu Val Ser Leu, Leu Leu Lys Cys Gly 
225 230 235 240 

Ala Asp Val Asn Arg Val Thr Tyr Gin Gly Tyr . Ser Pro Tyr Gin Leu 
245 250 255 

Thr Txp Gly Arg Pro Ser Thr Arg He Gin Gin Gin Leu Gly Gin Leu 
260 265 270 

Thr Leu Glu Asn Leu Gin Met Leu Pro Glu Ser Glu Asp Glu Glu Ser 
275 280 285 

Tyr Asp Thr Glu Sex Glu Phe Thr Glu Phe Thr Glu Asp Glu Leu Pro 
290 295 300 

Tyr Asp Asp Cys Val Phe Gly Gly Gin Arg Leu Thr Leu Thr Gly Met 
305 310 315 320 

Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
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325 330 335 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
340 345 350 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
355 360 365 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
370 375 380 

Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
385 390 395 40O 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
405 410 415 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
420 425 430 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
435 440 445 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
450 455 460 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
465 470 475 480 

He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser Val 
485 490 495 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
500 505 510 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser .Thr Gin Ser Ala Leu Ser 
515 520 525 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
530 535 540 • 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
545 550 555 
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This International Searching Authority found multiple inventions In this international application, as follows: 



1 . pi As all required additional search fees were timely paid by the applicant this International Search Report covers all 
' — J searciiable claims. 

2. rn As all searchable claims could be searched without effort justifying an additional fee. this Authority did not in^^e payment 

of any additional fee. 

3 I — I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
I — I covers only those claims for which fees were paid, specifically claims Nos.: 



4 I I No required additional search fees were timely paid by the applicant. ConsequenUy, this International Search Report is 
■ I — I restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest Q The additional search fees were accompanied by the applicant's protest. 

j ^ No protest accompanied the payment of additional search fees. 
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there is lack of clarity as it is unclear if the instructions detailing 
the method 

are to be used to define the kit (not allowable as a method may not be 
used t 

define a product) or alternatively were to be regarded as a sheet of 
paper, F 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions In respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) POT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched- This is • 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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